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2.  Sequential  HIV-1  isolates  from  three  individuals  were  obtained,  cloned,  and  mapped,  the  envelope  genes  sequenced. 
Detailed  molecular  analysis  showed  the  rate  of  evolution  for  the  envelope  gene  in  vivo  to  be  >.10'3  nucleotide 
substitutions  per  site  per  year,  a  rate  108  times  that  in  eukaryotic  genes  and  equal  to  that  in  influenza,  Visna,  and 
EIAV.  Changes  in  the  viral  genomes  consist  primarily  of  isolated  and  clustered  base  substitutions,  duplications,  and 
short  insertions  or  deletions  suggesting  that  reverse  transcriptase  misincorporatlon  and  copy-choice  misreading 
errors  are  responsible  in  large  part  for  variation  of  HIV-1  in  vivo  (Science  232:1548,  1986). 

3.  Computer-assisted  analysis  of  predicted  hydrophilicity,  surface  probability,  and  flexibility  identified  putative  antigenic 
sites  in  the  viral  envelope,  generally  in  hypervariable  regions  but  also  in  a  conserved  region  as  well  fJ.  Virology 
61:570,  1987). 

4.  Molecular  analysis  of  sequential  HIV-1  isolates  from  chronically-infected  individuals  demonstrated  for  the  first  time 
the  "quasispecies"  nature  of  HIV-1  in  vivo  and  parallel  evolution  and  persistence  of  complex  mixtures  of  genotypically 
distinct  viruses  during  persistent  infection  (Nature  334:440,  1988). 

5.  Isolation,  cloning,  and  sequencing  of  virus  from  an  infected  lab  worker  was  performed,  providing  evidence  for  down- 
regulation  of  viral  replication  in  vivo  yet  progressive  changes  in  the  immunodominant  envelope  sequences  (Science 
239:68,  1988). 

6.  A  biologically  unique  strain  of  HIV-2  (HIV-2/ST)  was  isolated  from  peripheral  blood  cells  of  a  healthy  Senegalese 
woman  and  shown  to  be  attenuated  in  its  ability  to  cause  cell  fusion  and  cell  killing  and  in  its  ability  to  be  transmitted 
cell-free.  A  full-length,  replication  competent  proviral  done  (AJSP4-27)  was  obtained  from  this  isolate,  shown  to  have 
the  same  phenotype  as  the  parental  isolate,  and  sequenced  in  its  entirety  (Science  240:1525,  1988;  J.  Viroloov 
64:890,  1990).  This  clone,  and  a  series  of  culture-derived  revertants  that  were  selected  for  cytopathic  activity,  now 
provide  a  substrate  for  structure-function  analysis  of  the  HIV-2  envelope  gene  and  protein. 

7.  The  molecular  identity  of  "HTLV-4"  was  shown  to  be  SIVMAC  (Nature  330:184, 1987).  A  novel  reading  frame  termed 
X,  present  in  SIVMAC  and  HIV-2  but  not  HIV-1,  was  identified  and  shown  to  encode  a  l4kD  protein  that  is  packaged 
in  the  virion  (J.  Viroioov  62:3501,  1988). 

8.  A  "nested"  PCR  amplification  technique  using  generic  or  "universal"  oligonudeotide  primer  pairs  was  developed  that 
led  to  the  detection,  cloning,  and  sequencing  of  the  first  SIVAQM  virus  strains  from  West  African  green  monkeys.  In 
addition,  because  of  the  increased  sensitivity  and  specificity  of  the  nested  PCR  technique,  this  could  be  done  using 
uncultured  peripheral  blood  mononuclear  cell  DNA  without  prior  in  vitro  cultivation  (manuscript  in  preparation). 

9.  Full-length,  replication  competent  HIV-1  proviral  clones  were  obtained  by  direct  lambda  phage  cloning  of  DNA  from 
uncultured  human  brain.  These  clones  are  the  first  such  clones  derived  directly  from  human  tissue  without  interim 
cell  culture  and  correspond  directly  to  the  viral  forms  present  in  vivo  in  infected  individuals.  The  progeny  virus 
derived  by  transfection  of  COS-1  cells  with  this  proviral  DNA  are  highly  replicative  and  monocyte  tropic  and  are  being 
characterized  in  regard  to  molecular  determinants  of  monocyte  tropism,  biologic  properties,  genomic  organization, 
gene  structure-function  relationships,  and  relevance  to  HIV-1  induced  neuropathogenesis  (manuscript  in  preparation). 
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When  contract  DAMD  17-87-C-7038  "Studies  ot  Genetic  Variation  in  the  AIDS  Virus: 
Relevance  to  Disease  Pathogenesis,  Antiviral  Therapy,  and  Vaccine  Development"  was  first 
initiated,  relatively  little  was  known  about  the  genetics  and  biology  of  the  HIV  and  SIV  groups 
of  viruses  and  their  relation  to  disease  pathogenesis  (8,50).  Over  the  relatively  short  period 
of  time  from  1981  to  1985,  15,000  cases  of  AIDS  had  been  diagnosed  in  the  United  States 
alone,  and  it  was  estimated  that  an  additional  40,000  cases  would  occur  in  the  U.S.  over  the 
next  two  years  (34,128).  It  was  further  estimated  that  up  to  1-2  million  Americans  had  already 
been  infected  with  the  HIV-1  virus,  and  it  was  not  known  what  proportion  of  these  would 
ultimately  develop  the  full-blown  AIDS  syndrome.  With  a  mortality  rate  approaching  100%, 
AIDS  was  recognized  as  clearly  one  of  the  most  serious  diseases  confronting  man  in  modern 
times.  There  were  a  number  of  reasons  that  AIDS  and  its  causative  agents,  HIV-1  and  HIV-2, 
were  of  prime  importance  to  the  U.S.  Armed  Forces  and  the  civilian  community  alike.  First, 
the  AIDS  virus  causes  a  spectrum  of  serious  and  often  fatal  diseases  and  is  transmitted 
through  blood  products  and  sexual  relations.  This  meant  that  a  great  many  persons,  including 
military  personnel  who  previously  were  not  included  in  high-risk  groups,  were  at  considerable 
risk  for  infection.  Second,  the  HIV  is  a  "geographic  agent"  such  that  in  certain  locales 
including  Africa,  Haiti,  and  various  U.S.  and  European  cities  its  prevalence  is  particularly  high. 
Third,  while  the  AIDS  epidemic  affects  persons  of  all  ages  it  is  particularly  prevalent  in  young, 
sexually  active  adult  men.  Fourth,  it  is  clear  that  HIV  can  be  readily  transmitted  through 
heterosexual  relations.  Fifth,  infection  by  HIV  has  a  definite  impact  beyond  clinically  defined 
of  AIDS.  For  example,  for  every  one  person  with  AIDS,  there  are  10  persons  with  an  AIDS- 
related  condition  (ARC)  such  as  lymphadenopathy,  diarrhea,  weight  loss,  hematologic 
abnormalities,  or  neurologic  dysfunction  and  there  are  100  more  with  asymptomatic  infection. 

Our  laboratory  had  been,  and  continues  to  be,  actively  involved  in  molecular  and 
genetic  studies  of  HIV-1,  HIV-2,  and  SIV.  We  reported  the  first  successful  cloning  of  HIV-1 
(65,145),  and  in  addition,  determined  that:  (i)  HIV-1  is  an  exogenous  human  retrovirus 
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approximately  10  kb  in  length  that  lacks  nuclei  acid  sequences  derived  from  normal  human 
DNA;  (ii)  the  virus  persists  in  cells  in  both  integrated  and  unintegrated  forms;  (iii)  different 
isolates  of  the  virus  exhibit  substantial  diversity,  or  heterogeneity,  in  their  restriction  enzyme 
cleavage  patterns;  and  (iv)  HIV-1  frequently  infects  the  brains  of  AIDS  patients  where  it  may 
lead  to  AIDS-associated  dementia  (65,66,69,143-145,164).  The  thrust  of  the  proposed  DoD 
work  for  contract  DAMD  17-87-C-7038  was  aimed  at  extending  these  studies.  The  reason  for 
placing  this  emphasis  on  the  genetic  variation  of  HIV  was  that  genetic  variation,  especially 
within  the  envelope  gene,  is  likely  critically  involved  in  the  virus’s  host-range  and  tissue 
specificity  and  in  its  ability  to  evade  host  defenses.  It  therefore  follows  that  a  thorough 
understanding  of  genetic  variation  will  be  essential  for  the  elucidation  of  viral  pathogenesis 
and  for  the  development  of  effective  vaccines,  anti-viral  agents,  and  diagnostic  reagents  for 
the  HIV/SIV  groups  of  viruses.  Specific  objectives  of  the  project  were: 

1.  To  determine  the  extent,  nature,  and  molecular  basis  of  HIV-1  genomic 
variability. 

2.  To  determine  the  rate  and  characteristics  of  HIV-1  genomic  variation  in 
chronically  infected  individuals  over  time. 

3.  To  analyze  the  deduced  amino  acid  sequence  of  the  envelope  of  HIV-1  for 
predicted  secondary  structure  and  antigenicity. 

4.  To  determine  the  molecular  and  biologic  properties  of  HIV-2  isolates  from 
individuals  in  West  Africa  where  HIV-2  appears  to  be  less  virulent  compared  with 
HIV-1  and  some  other  strains  of  HIV-2. 

5.  To  determine  the  molecular  basis  for  attenuation  of  cytopathicity  in  certain  strains 
of  HIV-2. 

6.  To  develop  PCR  techniques  for  sensitive  and  specific  amplification  of  novel 
HIV/SIV  genomes  using  universal  primers  and  "nested"  amplification  and  to  apply 
this  to  the  detection  and  characterization  of  SIV  and  HIV-2  in  uncultured  blood 
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of  West  African  green  monkeys  and  people. 

7.  To  molecularly  clone  full-length  replication  competent  HIV-1  proviruses  from 
uncultured  human  brain  specimens  for  analysis  of  virus  biology,  molecular 
determinants  of  tropism  and  virulence,  genomic  organization,  and  gene  structure- 
function  relationships. 

In  this  Final  Report,  work  done  to  complete  these  scientific  objectives  is  summarized. 

1.  To  determine  the  extent,  nature,  and  molecular  basis  of  HIV-1  genomic  variability. 

In  this  initial  phase  of  study,  molecular  clones  of  HIV-1  derived  from  virus  cultures  from 
blood  samples  of  a  Haitian  homosexual  man  (HAT-3),  a  Haitian  infant  (WMJ),  two  U.S. 
homosexual  men  (ARV-2  and  BH-10),  and  a  French  homosexual  man  (LAV-1)  were  studied 
(12,65,66,69,125,139).  The  genome  of  HIV-1  consists  of  long  terminal  repeat  (LTR)  elements; 
gag,  pol,  and  env  genes;  as  well  as  central  region  genes  vif,  vpr,  vpu,  tat,  and  rev  and  3’  orf 
for  3'  open  reading  frame  now  known  as  nef.  For  HAT-3,  we  determined  the  full  length 
proviral  sequence.  For  WMJ-1,  only  the  env  gene  was  sequenced,  and  for  two  other  WMJ 
clones  that  were  closely  related  to  WMJ-1,  both  env  and  gag  (pi  5  and  p24  regions)  were 
sequenced.  From  this  analysis,  and  from  restriction  enzyme  mapping  and  heteroduplexing, 
we  confirmed  that  the  overall  genomic  organization  of  HAT-3  and  WMJ-1  is  the  same  as  that 
described  previously  for  BH-10,  LAV-1,  and  ARV-2. 

Figure  1  compares  the  nucleotide  and  deduced  amino  acid  sequences  of  HAT-3, 
WMJ-1,  LAV-la,  and  ARV-2  to  the  original  HTLV-lllb  clone,  BH-10.  These  data  demonstrate 
the  existence  of  a  broad  spectrum  of  diversity  among  the  AIDS  retroviruses:  Overall,  BH-10 
differed  from  HAT-3  in  8.6%  of  nucleotides  (12.5%  amino  acids),  from  ARV-2  in  6.3%  of 
nucleotides  (9.2%  amino  acids),  and  from  LAV-la  in  1.5%  of  nucleotides  (2.2%  amino  acids). 
HAT-3  and  ARV-2  differed  from  each  other  over  the  lengths  of  their  sequenced  genomes  by 
9.3%  of  nucleotides  (14.2%  amino  acids).  A  similar  spectrum  of  diversity  among  independent 
HIV-1  isolates  has  been  reported  on  the  basis  of  restriction  enzyme  mapping  (13,164). 


6 


A 


The  distribution  of  sequence  differences  was  not  uniform  throughout  the  viral  genomes, 
instead,  when  divergent  genomes  were  compared,  changes  were  found  to  be  considerably 
more  prevalent  in  the  envelope  and  3’  orf  {net)  than  in  other  genes.  In  the  envelope  gene 
in  particular,  BH-10  differed  from  HAT-3  in  12.9%  of  nucleotides  (19%  amino  acids),  from 
ARV-2  in  9.8%  of  nucleotides  (14.0%  amino  acids).  In  the  same  region,  HAT-3  differed  from 
ARV-2  in  14.7%  of  nucleotides  (18.4%  amino  acids)  and  from  WMJ-1  in  11.3%  of  nucleotides 
(16.2%  amino  acids).  Within  the  envelope  gene,  the  signal  peptide  and  extracellular  portion 
were  the  regions  that  were  most  highly  divergent.  For  example,  the  extracellular  envelope 
domains  of  HAT-3,  WMJ-1,  and  ARV-2  differed  from  that  of  BH-10  in  21.4%,  18.7%  and 
17.0%  of  amino  acids  respectively.  The  two  Haitian  isolates,  HAT-3  and  WMJ-1,  differed  from 
each  other  in  this  region  by  21.6%  amino  acids  and  from  ARV-2  by  20.2%  and  20.8%, 
respectively. 

The  most  conserved  areas  in  the  genomes  of  HAT-3,  LAV-la,  BH-10,  and  ARV-2  were 
the  gag  and  pot  genes,  which  differed  among  the  viruses  in  less  than  6%  of  nucleotides  and 
less  than  7%  of  amino  acids.  In  these  regions,  nucleotide  sequence  changes  were  almost 
exclusively  due  to  point  mutations  in  contrast  to  env  where  clustered  nucleotide  changes 
involving  in-frame  deletions,  insertions,  and/or  duplications  were  common.  For  example, 
compared  to  BH-10,  there  were  48bp  of  insertions  and  21  bp  of  deletions  in  the  extracellular 
envelope  of  HAT-3  as  compared  to  only  6  bp  of  insertions  and  deletions  in  all  of  its  analyzed 
gag,  pol,  and  vif  sequences.  Similarly,  there  were  33  bp  of  insertions  and  36  bp  of  deletions 
in  the  extracellular  envelope  of  ARV-2  (compared  to  BH-10)  but  only  a  6  bp  insertion  and  no 
deletions  in  al  of  its  gag,  pol,  and  sor  {vif).  Even  in  LAV-la,  which  is  more  closely  related  to 
BH-10  than  are  the  other  sequenced  viruses,  there  was  a  15  bp  insertion  (duplication)  in  the 
exterior  envelope  gene  but  no  insertions  or  deletions  in  gag,  pol,  or  sor  {vif). 

Another  difference  in  the  types  of  mutations  present  in  the  env  gene  compared  to  the 
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gag  gene  of  these  viruses  was  in  the  proportion  of  silent  third  base  pair  changes.  Excluding 
deletions  and  insertions,  compared  to  BH-10  the  mutations  in  the  external  env  genes  of 
ARV-2,  HAT-3,  and  WMJ  (clone  WMJ-2  from  patient  WMJ)  were  in  the  third  position  of  the 
codon  in  36%,  37%  and  34%  of  instances  leading  to  32%,  25%  and  42%  amino  acid  changes, 
respectively.  Conversely,  in  the  gag  genes  of  these  same  viruses,  63%,  56%,  and  66%  of 
nucleotide  changes  were  in  the  third  place  of  the  codon  leading  to  10%,  3.5%  and  0%  amino 
acid  changes,  respectively.  Thus,  more  than  half  of  the  single  nucleotide  changes  in  env 
occurred  in  the  first  or  second  codon  position,  and  even  for  third  position  changes,  many  of 
these  led  to  amino  acid  changes.  Conversely,  only  a  minority  of  first  or  second  codon  position 
changes  occurred  in  gag,  and  in  this  gene  the  third  position  changes  were  almost  uniformly 
silent.  These  findings  are  consistent  with  two  interpretations;  first,  that  structure-function 
relationships  lead  to  stronger  conservation  of  amino  acid  sequence  in  gag  than  in  env,  and 
second,  that  non-silent  nucleotide  changes  in  env  may  actually  be  under  positive  selection 
pressure.  Whether  or  not  the  latter  interpretation  is  correct  must  await  a  much  more  complete 
understanding  of  the  host  immunologic  response  to  HIV-1. 

An  alignment  of  the  envelope  nucleotide  and  amino  acid  sequences  of  HAT-3,  WMJ-1 
BH-10,  LAV-1  a,  and  ARV-2  is  shown  in  Figures  2  and  3.  A  methionine  codon  at  position  8 
of  the  BH-10  envelope  open  reading  frame  most  likely  marks  the  beginning  of  the  gene,  with 
a  hydrophobic  region  between  position  17  and  37  representing  the  potential  leader  sequence, 
or  signal  peptide.  This  signal  peptide  is  cleaved  from  the  envelope  precursor  protein  during 
envelope  maturation  (5).  The  ensuing  peptide  sequence  from  position  38  to  518  (BH-10 
sequence)  is  slightly  hydrophilic  and  contains  numerous  potential  N-linked  glycosylation  sites, 
ranging  from  22  in  WMJ-1  to  29  in  HAT-3.  This  region  corresponds  to  the  major  exterior 
envelope  glycoprotein  (gp120).  A  characteristic  arginine-rich  hydrophobic  stretch  marks  the 
cleavage  site  for  the  processing  of  the  envelope  precursor  gp160  into  the  exterior  gp120  and 
membrane-bound  gp41  (5,42).  The  latter  protein  includes  an  apparent  hydrophobic  membrane- 
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spanning  segment,  a  hydrophilic  anchor  sequence,  and  an  additional  carboxy-terminal  stretch 
of  150  residues  whose  function  is  presently  unknown. 

A  striking  feature  of  the  five  envelope  sequences  was  the  conservation  of  cysteine 
residues  (Figure  3).  Within  the  extracellular  envelope,  each  of  the  1 8  cysteine  residues  was 
conserved  in  all  five  viruses.  Within  the  transmembrane  protein,  all  but  one  cysteine  residue 
in  its  extreme  carboxy-terminus  was  conserved.  This  finding  argues  for  a  highly  conserved 
"macrostructure"  of  different  HIV-1  envelope  glycoproteins.  Despite  this  overall  conservative 
structure  numerous  amino  acid  changes  were  evident  throughout  the  envelope  glycoprotein. 
In  the  transmembrane  region  these  were  generally  the  result  of  isolated  nucleotide  point 
mutations  leading  to  single  amino  acid  substitutions.  In  the  extracellular  region,  changes  in 
amino  acid  sequence  resulted  from  both  nucleotide  point  mutations  and  in-frame  deletions, 
insertions  or  duplications  of  blocks  of  nucleotides.  The  resulting  amino  acid  alterations 
appeared  as  clustered  mutations  interspersed  with  polypeptide  segments  that  were  highly 
conserved.  The  regions  of  highest  variability  (dark  shading  in  Figure  3)  and  highest 
conservation  (light  shading  in  Figure  3)  were  identified  both  by  visual  inspection  and  by 
objective  determination  of  relative  variation  using  computer  analysis  (Figure  4). 

Since  the  extracellular  region  of  the  HIV-1  envelope  glycoprotein  constitutes  a  major 
target  of  the  host  immunologic  response  (10),  we  examined  this  region  for  predicted  antigenic 
epitopes  using  a  computer  program  that  predicts  the  secondary  structure  of  proteins 
superimposed  with  values  for  hydrophilicity  (24,80).  Such  an  analysis  of  other  proteins, 
including  viral  envelopes  has  shown  that  antigenic  epitopes  are  often  associated  with 
hydrophilic  protein  domains  containing  p  turns  (7,32,46,62,121,160).  This  analysis 
demonstrated  that  the  exterior  envelope  proteins  of  the  five  AIDS  retroviruses  each  contain  a 
number  of  sites  that  meet  criteria  for  likely  antigenic  epitopes  and  that  these  regions  generally 
coincide  with  the  variable  regions  identified  independently  by  amino  acid  sequence 
comparisons.  Figure  5  illustrates  the  substantial  differences  in  predicted  secondary  protein 
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structure  for  the  first  variable  region  of  the  external  envelope  gene  product  for  five  viruses. 
In  this  region,  there  were  7  p  turns  in  HAT-3,  0  (5  turns  in  WMJ-1,  8  p  turns  in  BH-10,  11  p 
turns  in  LAV-1  a,  and  5  p  turns  in  ARV-2.  The  five  envelope  sequences  also  differed  in  this 
region  in  their  degrees  of  hydrophilicity  and  in  the  number  and  location  of  potential  N-linked 
glycosylation  sites  (Figures  2  and  3).  In  each  of  the  other  variable  regions  substantial 
differences  existed  similarly  in  predicted  hydrophilicity,  secondary  structure,  and  potential 
glycosylation  sites. 

Interspersed  with  the  variable  regions  of  the  exterior  envelope  protein  were  other  areas 
that  were  highly  conserved  among  all  five  isolates  analyzed.  Most  of  these  regions  were 
primarily  hydrophobic  and  contained  only  few  or  no  p  turns.  Accordingly,  they  are  believed 
to  be  less  likely  to  represent  antigenic  epitopes.  There  were,  however,  a  number  of 
exceptions.  One  such  exception  is  a  conserved  stretch  of  46  amino  acid  residues  immediately 
adjacent  to  the  processing  site  of  the  envelope  precursor  (amino  acids  473  through  518  in 
Figure  3).  This  area  was  very  highly  conserved  among  all  five  viruses,  contained  numerous 
P  turns,  and  was  hydrophilic.  Thus  this  conserved  region  of  the  exterior  envelope  glycoprotein 
would  be  expected  to  be  both  antigenic  and  cross-reactive  among  different  viral  strains. 

2.  To  determine  the  rate  and  characteristics  of  HIV-1  genomic  variation  in 

chronically  infected  Individuals  over  time. 

The  results  shown  in  Figures  1-5  indicated  that  (i)  within  the  extracellular  env  gene 
there  are  localized  regions  of  hypervariabiiity  interspersed  with  regions  of  strong  conservation; 
(ii)  genetic  changes  among  different  viruses  result  largely  from  duplications,  insertions,  or 
deletions  of  short  stretches  of  nucleotides  as  well  as  from  an  accumulation  of  nucleotide  point 
mutations;  and  (iii)  hypervariable  env  regions,  and  some  constant  regions,  have  properties 
predictive  of  antigenicity  based  on  analysis  of  predicted  secondary  protein  structure, 
hydrophilicity,  and  glycosylation  pattern.  To  define  further  the  extent,  rate  and  nature  of 
genetic  variation  of  HIV-1  in  vivo  we  have  studied  sequential  virus  isolates  from  three 
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persistently  infected  patients  with  AIDS  or  risk  factors  for  AIDS.  The  isolates  were  obtained 
on  four  to  six  occasions  over  a  1-  or  2-year  period.  The  first  patient  (coded  WMJ)  was  a 
Haitian  child  with  AIDS  who  was  born  in  Miami,  Florida,  in  1982  and  infected  perinatally  by 
her  HIV-1 -positive  mother.  The  second  patient  (coded  WMF)  was  a  34-year-old  Haitian  woman 
with  AIDS-related  complex  (ARC)  who  was  also  from  Miami  and  whose  only  known  risk  factor 
for  HIV-1  infection  was  heterosexual  exposure.  The  third  patient  (coded  RJS)  was  a  31 -year- 
old  American  homosexual  man  from  California  who,  despite  chronic  HIV-1  infection,  remained 
clinically  well  for  5  years  but  eventually  developed  evidence  of  T-cell  dysfunction  and  died  of 
AIDS.  He  reported  having  sexual  encounters  with  at  least  1000  different  men  between  1980 
and  1985.  These  three  subjects  thus  represented  a  broad  clinical  spectrum  of  HIV-1-related 
disease  (AIDS,  ARC,  and  clinically  normal  virus  carrier),  different  modes  of  virus  transmission 
(perinatal,  heterosexual,  and  homosexual),  and  widely  different  numbers  of  exposures  to  the 
AIDS  virus  (presumably  one  for  WMJ,  limited  encounters  for  WMF,  and  very  large  numbers 
for  RJS).  All  virus  isolations  were  made  from  the  patients'  peripheral  blood  mononuclear  cells 
(PBMC)  that  were  cocultivated  with  normal  PBMC  and,  in  most  instances,  transmitted  to 
immortalized  T  cell  lines. 

Figure  6,  shows  that,  for  patient  WMJ  the  Sst  I,  EcoRI,  and  Pst  I  restriction  enzyme 
patterns  for  each  of  the  five  isolates  were  identical  and  that  the  Bgl  II  and  Pvu  II  patterns  were 
almost  identical.  With  Hind  III,  the  isolates  were  clearly  distinct  but  still  highly  related;  for 
example,  the  loss  of  a  single  Hind  III  site  in  WMJ-1  led  to  the  Hind  III  pattern  observed  in 
WMJ -2  (See  Figure  7).  In  contrast,  the  enzyme  cleavage  patterns  of  the  five  WMJ  isolates, 
which  as  a  group  were  quite  similar  to  each  other,  differed  markedly  from  viruses  of  patients 
WMF  and  RJS  as  well  as  from  all  other  patients’  viruses  tested.  The  observation  that  serial 
virus  isolates  from  a  single  patient  (WMJ)  were  more  highly  related  to  each  other  than  to 
viruses  from  other  patients  was  also  true  for  isolates  from  WMF  and  RJS.  For  example,  the 
four  isolates  from  WMF  were  identical  in  Sst  I  and  EcoRI  patterns  and  nearly  identical  when 
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analyzed  with  Pvu  II  and  Pst  I.  Moreover,  the  differences  with  Hind  III  could  be  explained 
simply  by  either  the  loss  or  gain  of  single  restriction  sites.  For  RJS,  all  six  isolates  were 
identical  with  Sst  I  and  Eco  Rl  and  nearly  identical  with  Bgl  II,  Pvu  II,  and  Pst  I. 

For  most  virus  isolates  shown  in  Figure  6,  the  sizes  of  the  restriction  fragments 
generated  by  any  given  enzyme  added  up  to  9  kilobases,  the  approximate  genomic  size  of 
HIV-1.  However,  for  certain  virus  isolates  from  patients  WMF  and  RJS,  some  of  the  restriction 
enzymes  generated  fragments  that  added  up  to  more  than  one  genomic  equivalent  (for 
example,  Bgl  II,  Pvu  II,  and  Pst  I  digestions  of  RJS-4;  the  Hind  III  patterns  of  RJS-2,  RJS-5, 
and  RJS-6;  the  Bgl  II  of  WMF-1;  and  the  Pst  I  pattern  of  WMF-3).  These  particular  isolates 
were  therefore  composed  of  a  mixture  of  more  than  one  highly  related,  yet  genotypically 
distinct,  virus  populations.  That  the  restriction  digestions  did,  in  fact,  reflect  accurately  the 
presence  of  more  than  one  predominant  viral  genotype  and  were  not  reflecting  incomplete 
digestions  was  shown  by  the  following:  (i)  several  different  enzymes  generated  bands  whose 
additive  size  in  each  case  was  greater  than  9  kb;  (ii)  repeated  digestions  of  these  same 
DNA’s  with  excess  enzyme  gave  identical  result;  and  (iii)  molecular  clones  of  viral  DNA 
obtained  from  several  of  these  DNA’s  corresponded  to  the  different  viral  patterns  apparent  on 
the  blot  hybridizations. 

Although  more  than  one  predominant  viral  genotype  was  present  in  some  virus  isolates, 
all  of  the  genotypically  distinct  viral  forms  in  any  one  patient  were  still  very  similar  to  each 
other  as  opposed  to  independent  virus  isolates  from  different  individuals  that  were  distinct. 
For  example,  although  two  viral  forms  could  be  distinguished  by  the  Bgl  II  digestion  of  WMF-1, 
these  two  viral  forms  were  identical  in  their  Sst  I,  Hind  III,  EcoRI,  Pvu  II,  and  Pst  I  patterns. 
Similarly,  the  two  viral  forms  distinguishable  by  Hind  III  in  RJS-2,  RJS-5,  and  RJS-6  were 
identical  in  Sst  I,  EcoRI,  Pvu  II,  and  Pst  I  patterns.  By  way  of  contrast,  the  WMF  viruses  and 
RJS  viruses,  as  groups,  differed  from  each  other  dramatically  in  Sst  I,  Hind  III,  Bgl  II,  Pvu  II, 
and  Pst  I  patterns.  Thus,  within  any  given  individual,  even  one  presumably  exposed  to 
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numerous  different  AIDS  viruses,  only  one  or  a  very  limited  number  of  virus  strains,  as  defined 
by  restriction  enzyme  analysis,  could  be  detected  by  virus  isolation  and  DNA  blot  hybridization. 
It  is  possible,  of  course,  that  these  patients  were  actually  infected  with  many  different 
genotypic  viral  forms  but,  because  of  their  unequal  growth  characteristics,  most  were  not 
evident  after  cultivation  in  vitro.  We  cannot  rule  out  this  possibility,  but  it  is  of  note  that  work 
involving  Southern  blot  hybridization  analyses  of  uncultured  brain  and  lymphoid  tissues  from 
AIDS  patients  also  indicates  the  presence  of  only  one  or  a  very  limited  number  of  predominant 
viral  forms  in  situ  (135,144). 

To  define  in  greater  detail  the  nature  of  genetic  changes  in  HIV-1  that  occur  in  infected 
patients  over  time,  we  used  subgenomic  probes  to  map  the  restriction  enzyme  cleavage 
patterns  of  isolates  WMJ-1  through  WMJ-5  (Figure  7).  In  addition,  the  proviral  genomes  of 
WMJ-1 ,  WMJ-2,  and  WMJ-3  were  molecularly  cloned  and  the  nucleotide  and  deduced  amino 
acid  sequences  of  their  env  genes  determined  (Figure  8).  The  restriction  maps  in  Figure  7 
represent  the  restriction  cleavage  patterns  of  proviral  DNA  for  each  of  the  five  WMJ  isolates 
shown  in  Figure  6.  The  results  indicated  that  each  of  the  five  WMJ  isolates  that  were 
obtained  over  a  1-year  period  were  highly  related  to  each  other,  differing  in  only  one  or  a  few 
restriction  sites.  For  example,  out  of  29  restriction  sites  tested,  WMJ-2  differed  from  WMJ-1 
in  only  four  sites  and  from  WMJ-3,  WMJ-4,  and  WMJ-5  in  three,  one,  and  one  sites, 
respectively.  Of  particular  note  was  the  finding  that  although  the  five  WMJ  virus  isolates  were 
obtained  sequentially,  these  viruses  had  apparently  not  evolved  genetically  in  such  a  direct 
fashion,  that  is,  from  WMJ-1  to  WMJ-2  to  WMJ-3  and  so  on.  This  could  be  deduced  from  the 
restriction  maps  by  observing  the  nondirectional  changes  in  certain  of  the  Pvu  II,  Hind  III,  Kpn 
I,  and  Bgl  II  sites  (Figure  7). 

That  isolates  WMJ-1,  WMJ-2,  WMJ-3,  WMJ-4,  and  WMJ-5  had  evolved  in  parallel  was 
confirmed  by  determining  and  comparing  the  env  nucleotide  sequences  of  molecular  clones 
of  proviral  DNA  from  WMJ-1,  WMJ-2,  and  WMJ-3  (Figure  8).  Overall,  the  env  of  WMJ-1 
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(clone  A.WMJ-1-V)  differed  from  WMJ-2  (clone  A.WMJ-2-III)  in  2.8%  of  nucleotides  and  that  of 
WMJ-2  differed  from  WMJ-3  (clone  AWMJ-3-III)  in  3.1%  of  nucleotides.  If  WMJ-3  had  evolved 
directly  from  WMJ-2,  which  had  in  turn  evolved  from  WMJ-1 ,  one  would  have  expected  to  find 
approximately  5%  to  6%  nucleotide  differences  between  WMJ-3  and  WMJ-1.  This  was  not 
the  case  because  WMJ-3  differed  from  WMJ-1  by  only  2.3%  of  nucleotides.  Direct  inspection 
of  the  nucleotide  alignment  of  WMJ-1,  WMJ-2  and  WMJ-3  (Figure  8)  also  revealed  many 
examples  where  WMJ-1  was  more  similar  to  WMJ-3  than  to  WMJ-2,  and  vice  versa.  Of 
importance,  the  restriction  patterns  of  each  of  the  three  WMJ  proviral  clones  that  were 
sequenced  corresponded  exactly  (20  out  of  20  restriction  sites)  to  the  predominant  genomic 
proviral  forms  for  WMJ-1,  -2,  and  -3  present  in  infected  cells  and  shown  in  Figures  6  and  7. 
Furthermore,  between  5  and  20  proviral  DNA  clones  were  obtained  from  the  recombinant 
libraries  or  each  isolate,  and  again,  the  ones  chosen  for  sequence  analysis  were  full-length 
and  representative  of  the  majority.  Finally,  to  be  certain  that  the  observed  nucleotide 
differences  among  the  clones  were  not  an  artifact  due  to  technical  procedures,  we  sequenced 
a  portion  of  the  env  gene  of  two  proviral  clones  from  the  same  library.  These  two  clones 
(AWMJ-2-III  and  A.WMJ-2-XIII),  both  isolated  from  a  recombinant  A.  phage  library  of  WMJ-2 
DNA,  were  identical  in  1220  out  of  1221  nucleotides  sequenced.  Thus,  the  sequenced  proviral 
clones  were  representative  of  the  predominant  viral  forms  in  each  of  the  isolates,  and 
observed  sequence  differences  reflected  variation  in  the  HIV-1  env  gene. 

Since  it  was  shown  that  WMJ-1,  WMJ-2,  and  WMJ-3  were  highly  related  and  had 
evolved  from  a  common  progenitor,  and  since  the  date  of  infection  of  patient  WMJ  by  HIV-1 
was  known,  it  was  possible  to  determine  the  approximate  rate  of  genetic  evolution  for  the 
AIDS  virus  during  a  natural  infection.  Many  data  documenting  the  rates  of  evolution  of  other 
RNA  viruses  as  well  as  DNA  genomes  had  led  to  the  conclusion  that  RNA  viruses  in  general 
evolved  at  a  rate  much  greater  than  DNA  genomes  (43,79).  For  calculating  the  rate  of  genetic 
change  for  HIV-1,  we  assumed  that  WMJ-1,  WMJ-2,  and  WMJ-3  had  all  evolved  from  a 


14 


common  progenitor  virus  sometime  within  the  preceding  5  years.  This  supposition  was  based 
on  the  following  information  (i)  Patient  WMJ,  who  had  only  a  single  perinatal  exposure  to 
HIV-1,  was  2  years  old  when  isolates  1,  2,  and  3  were  obtained,  (ii)  WMJ-1,  -2,  and  -3  were 
but  three  of  five  highly  related  viruses  isolated  from  this  same  patient,  (iii)  All  five  WMJ 
viruses  were  highly  related  to  each  other  and  very  different,  as  a  group,  from  other 
independent  viruses  that  we  had  evaluated  (including  five  other  isolates  from  Haitian  individuals 
living  in  Miami),  (iv)  Within  the  hypervariable  envelope  regions,  the  nucleotide  sequences  of 
WMJ-1 ,  -2,  and  -3  were  very  similar  to  each  other  and  dramatically  different  from  those  of 
isolates  from  other  patients.  These  data,  taken  together,  indicate  that  the  five  WMJ  isolates 
all  evolved  from  a  common  progenitor  virus.  Conceivably,  patient  WMJ  could  have  been 
infected  perinatally  with  more  than  one  virus.  Even  if  this  were  so,  however,  these  viruses 
would  still  have  had  to  evolve  in  the  immediate  past  from  a  common  progenitor  virus  present 
in  WMJ’s  mother. 

Using  1  to  5  years  as  an  estimate  of  the  time  since  divergence  of  WMJ-1 ,  WMJ-2,  and 
WMJ-3  from  a  common  point,  we  calculated  the  rates  of  evolution  for  the  env  and  gag  genes 
of  HIV-1  as  follows: 

R  =  D/2T 

where  T  is  the  rate  of  nucleotide  substitutions  per  site  per  year  and  T  is  the  time  since 
divergence  of  the  viruses 

D  =  -(3/4)  In  [1  -  (4/3) P] 

where  P  is  the  proportion  of  different  nucleotides  between  the  homologous  genes.  From  this 
equation  and  the  determined  nucleotide  sequences  of  WMJ-2  and  WMJ-3,  the  rate  of  evolution 
(R)  for  the  HIV-1  env  gene  was  estimated  to  be  between  1.58  x  10'2  and  3.17  x  10"3 
nucleotide  substitutions  per  site  per  year  for  T  values  of  1  year  and  5  years,  respectively. 
Similarly,  the  rate  of  genetic  mutation  for  the  gag  p24  and  pi  5  genes  was  estimated  to  be 
between  1.85  x  10'3  and  3.70  x  104.  This  rate  of  genetic  change  for  the  AIDS  virus  is  more 
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than  a  millionfold  greater  than  for  most  DNA  genomes  and  may  even  be  tenfold  greater  than 
from  some  other  RNA  viruses  including  certain  retroviruses  and  influenza  A  virus. 

We  also  compared  the  nature  and  distribution  of  nucleotide  changes  within  the  env 
genes  of  sequentially  isolated  viruses  from  persistently  infected  individuals  with  those  in 
independent  virus  isolates  from  different  patients.  The  changes  in  the  env  genes  of  WMJ-1, 
WMJ-2,  and  WMJ-3  were  almost  exclusively  nucleotide  point  mutations.  This  corroborates  our 
earlier  findings  with  independent  virus  isolates  that  single  and  clustered  base  pair  substitutions, 
presumably  resulting  from  errors  in  transcription,  are  an  important  source  of  variation  of  HIV-1. 
In  addition  to  point  mutations,  however,  there  were  also  three  small  (3  bp)  deletions  in  WMJ- 
2  and  WMJ-3  compared  to  WMJ-1  (figure  3).  Their  presence  underscores  the  importance  of 
nucleotide  deletions  or  insertions  as  well  as  substitutions  as  a  mechanisms  for  genomic 
change  in  the  AIDS  virus,  a  conclusion  also  supported  by  analysis  of  independent  HIV-1 
isolates.  The  distribution  of  nucleotide  substitutions  and  deletions  or  insertions  in  the  three 
WMJ  isolates  was  not  uniform  throughout  the  env  gene.  Instead,  such  changes  were 
clustered,  coinciding  with  hypervariable  regions  previously  identified  in  five  independent  AIDS 
virus  isolates  (see  Figure  9). 

In  the  studies  of  HIV-1  variation  described  in  Section  1,  we  observed  that  independent 
as  well  as  sequential  virus  isolates  from  individual  patients  showed  evidence  of  rapid  genetic 
change.  Genotypic  analysis  of  paired  HIV-1  isolates  from  donor-recipient  transfusion  cases 
(148)  and  nucleotide  sequence  comparisons  of  geographically  distinct  HIV-1  strains  (3) 
provided  further  evidence  for  the  highly  variable  nature  of  HIV-1.  However,  the  spectrum  of 
HIV-1  variation  in  chronically  infected  individuals,  the  mechanisms  underlying  this  variation,  and 
its  biological  consequences  still  remained  unknown.  The  experiments  described  in  Figures  6-9 
provide  insights  into  this  area.  To  address  this  question  further,  we  analyzed  a  group  of 
molecular  proviral  clones  from  sequential  virus  isolates  from  two  infected  individuals  (RJS  and 
WMF)  with  HIV-1  associated  clinical  disease.  Recombinant  lambda  phage  libraries  of  one  RJS 
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isolate  (RJS4)  and  two  WMF  isolates  (WMF1  and  WMF3)  were  prepared  so  that  the  viral  DNA 
molecules  which  in  toto  comprise  the  hybridization  patterns  of  the  overall  isolate  DNA  (Figure 
6)  could  be  dissected  and  analyzed  individually.  A  total  of  27,  17,  and  18  full-length  HIV-1 
clones  were  obtained  from  the  three  libraries  (Figure  10).  Careful  restriction  enzyme  mapping 
using  11  endonucleases  revealed  that  17  of  the  27  RJS4  clones,  10  of  the  17  WMF1  clones, 
and  13  of  the  18  WMF3  clones  were  distinguishable  by  unique  cleavage  patterns.  The 
remarkably  large  number  of  distinguishable  viral  clones  within  each  isolate  resulted  from 
various  combinations  of  restriction  site  polymorphisms  distributed  throughout  the  viral  genomes. 
As  expected,  the  restriction  patterns  of  the  predominant  HIV-1  clones  corresponded  with  the 
fragments  visualized  on  blot-hybridizations  of  the  isolate  DNA  from  which  they  were  cloned. 
Because  each  restriction  enzyme  generated  only  one  to  three  different  fragmentation  patterns, 
the  many  unique  viral  genotypes  shown  in  this  report  to  comprise  a  virus  isolate  had 
previously  gone  unrecognized. 

Inspection  of  the  genomic  restriction  patterns  of  the  HIV-1  clones  in  Figure  10  showed 
that  clones  from  individual  virus  isolates  (RJS4a-q;  WMFIa-j;  WMFIa-m)  were  considerably 
more  similar  to  other  clones  within  that  isolate  than  to  unrelated  clones  such  as  HXB2,  LAV- 
Mal,  LAV-Eli,  ARV2,  and  WMJ1.  In  order  to  quantify  objectively  the  extent  of  similarity  among 
the  different  viral  clones,  a  pairwise  comparison  of  the  percentage  of  restriction  site  differences 
between  each  of  the  45  viral  genotypes  shown  in  Figure  10  was  performed  (total  of  1,035 
independent  comparisons;  Figure  11).  Such  an  analysis  of  restriction  site  differences  between 
related  clones  is  a  valid  means  for  estimating  overall  nucleotide  sequence  variability  (40,91). 
Assuming  that  the  loss  of  a  restriction  site  in  otherwise  highly  related  genomes  results  from 
a  change  in  a  single  nucleotide,  a  50%  difference  in  restriction  sites  when  using  enzymes  that 
recognize  6  base  pair  sequences  corresponds  to  approximately  8%  nucleotide  sequence 
difference  (i.e.,  1  nucleotide  change  out  of  every  12  nucleotides  sampled).  We  used  this 
approach  first  to  analyze  clones  of  HIV-1  for  which  nucleotide  sequence  information  was 
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available,  and  the  correlation  between  nucleotide  sequence  differences  and  restriction  site 
differences  was  found  to  be  quite  good.  For  example,  clone  HXB2  differs  from  clones  BH10, 
ARV2,  and  LAV-Eli  by  13%,  41%,  and  62%  in  restriction  sites  (ref.  145  and  Figure  11).  Based 
on  these  restriction  site  differences,  the  predicted  nucleotide  differences  would  be  2.2%,  6.8%, 
and  1 0.5%.  The  actual  differences  determined  by  nucleotide  sequence  comparisons  are  1 .6%, 
5.8%,  and  9.7%,  respectively  (59).  Similarly,  clones  of  WMJ2  and  WMJ3,  both  of  which  were 
derived  from  a  single  HIV-1  infected  patient,  differ  by  12%  in  restriction  sites  and  by 
approximately  1-2%  in  nucleotide  sequence  (69).  Clones  of  LAV-Mal,  LAV-Eli,  and  ARV2 
differ  from  each  other  by  58  -  70%  in  restriction  sites  (Figure  11)  and  by  10.1  to  13.0%  in 
nucleotide  sequence.  In  the  present  study,  the  restriction  cleavage  site  differences  among  all 
45  clones  ranged  from  3%  to  70%.  The  viral  DNA  genomes  comprising  isolates  RJS4,  WMF1 
and  WMF3  were  considerably  more  similar  to  other  viral  genomes  from  within  the  same  isolate 
than  to  viral  genomes  from  unrelated  (independent)  isolates.  For  example,  the  17  different 
RJS4  genotypes  varied  from  each  other  by  3  -  28%  (mean  of  13%)  whereas  the  same  clones 
varied  from  independent  viruses  by  41  -  70%  (mean  of  55%;  p<0.0001).  The  10  different 
WMF1  genotypes  varied  from  each  other  by  3  -  33%  (mean  of  13%)  and  from  independent 
viruses  by  34  -  64%  (mean  of  54%;  p<0.0001).  The  13  different  WMF3  genotypes  varied  from 
each  other  by  4  -  29%  (mean  of  13%)  and  from  independent  viruses  by  38  -  65%  (mean  of 
56%;  pcO.0001).  These  data  show  that  while  considerable  genotypic  diversity  of  viruses  exists 
within  a  given  isolate,  these  multiple  distinguishable  viral  forms  have  all  clearly  evolved  from 
one  another,  or  from  common  precursor  viruses,  and  do  not  represent  concomitant  infection 
by  unrelated  (independent)  viruses. 

Isolates  WMF1  and  WMF3  were  derived  from  cultures  of  peripheral  blood  lymphocytes 
of  the  same  individual  taken  16  months  apart.  If  the  viral  forms  present  in  WMF3  had  evolved 
sequentially  from,  or  in  parallel  with,  the  viruses  present  in  WMF1,  the  extent  of  similarity 
between  WMF1  and  WMF3  viral  genomes  should  be  intermediate  between  values  for  clones 
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from  within  each  isolate  and  clones  from  independent  (unrelated)  isolates.  If  superinfection 
with  unrelated  viruses  had  occurred,  genotypes  of  WMF1  and  WMF3  viruses  would  be 
expected  to  vary  to  the  same  extent  as  the  unrelated  viruses  HXB2,  LAV-Mal,  LAV-Eli,  ARV2, 
WMJ1,  RJS4a,  and  WMFIa.  The  data  in  Figure  11  show  that  the  former,  not  the  latter,  was 
the  case.  The  viral  clones  WMF1  (a-j)  differed  from  clones  WMF3  (a-m)  by  11  -  44%  (mean 
of  23%),  an  intermediate  value  statistically  different  from  those  for  viruses  from  within  single 
isolates  (mean  of  13%;  p  <  0.0001)  or  unrelated  (independent)  isolates  (mean  of  55%;  p  < 
0.0001).  These  data,  plus  the  unique  similarities  among  WMF1  and  WMF3  clones  depicted 
in  Figure  10,  indicate  that  the  viruses  comprising  the  latter  isolate  evolved  either  from  genomes 
present  in  the  former  isolate  or  from  immediate  precursor  genomes  common  to  both.  The 
absence  of  clones  in  WMF3  that  are  identical  to  clones  in  WMF1,  and  the  existence  of  certain 
viral  genotypes  in  WMF1  and  WMF3  that  differ  by  as  much  as  39%  in  restriction  sites  (e.g., 
WMFIh  and  WMF3m),  underscores  the  extremely  rapid  rate  of  change  of  HIV-1  in  vivo. 

As  a  control  for  these  studies,  and  to  determine  the  extent  to  which  in  vitro  changes 
in  HIV-1  could  contribute  to  the  observed  genetic  differences  in  RJS  and  WMF  isolates,  we 
determined  the  extent  of  HIV-1  genotypic  variation  occurring  during  cultivation  and  amplification 
of  virus  from  infected  patient  tissues.  Peripheral  blood  mononuclear  cells  (PBMC)  obtained 
from  patient  RJS  at  the  same  time  that  isolate  RJS4  had  been  obtained  were  divided  into  two 
parts  and  each  was  co-cultivated  with  PHA-stimulated  PBMC  from  different  HLA-unrelated 
normal  donors.  The  restriction  cleavage  patterns  of  the  two  virus  isolates  obtained  after  6 
weeks  of  culture  were  identical  (Figure  12A),  implying  that  the  virus  populations  isolated  were 
representative  of  viruses  present  in  vivo.  PBMC  taken  from  the  same  patient  two  and  four 
weeks  after  the  first  phlebotomy  yielded  virus  populations  that  were  genetically  related  to  the 
first  isolates  yet  distinguishable  (Figure  12B),  reflecting  the  recognized  genotypic  heterogeneity 
of  HIV-1  in  vivo.  Finally,  the  genotypic  pattern  of  virus  cultured  from  the  brain  of  an  HIV-1 
infected  patient  (BB)  was  compared  to  viral  DNA  extracted  directly  from  an  uncultured 
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specimen  of  the  same  tissue.  The  restriction  patterns  were  identical  (Figure  12C)  indicating 
that  the  overall  genotypic  pattern  of  virus  isolated  by  short-term  culture  corresponds  to  virus 
populations  actually  present  in  vivo.  In  a  separate  set  of  experiments,  a  recombinant  lambda 
phage  library  identical  to  those  prepared  for  RJS4,  WMF1,  and  WMF3  was  prepared  from  an 
isolate  of  HIV-1  that  had  been  biologically-cloned  by  five  sequential  cell-free  end-point  dilutions 
of  culture  supernatant.  This  was  done  to  obtain  a  genotypically-pure  virus  stock  that  could 
then  be  expanded  in  culture,  as  is  done  during  the  isolation  of  HIV-1  from  human  tissues,  in 
order  to  determine  how  much  genotypic  variation  is  introduced  during  virus  amplification  in 
vitro.  From  this  library,  10  full-length  viral  DNA  clones  were  identified  and  these  were 
identical  in  270  out  of  273  restriction  sites  mapped  (average  of  27  sites  per  clone;  data  not 
shown).  This  degree  of  restriction  enzyme  site  variability  (1%)  in  a  biologically-cloned  isolate 
that  was  amplified  through  multiple  rounds  of  replication  was  significantly  less  than  for  the 
isolates  RJS4  (13%),  WPF1  (13%),  and  WPF3  (13%)  (Figure  11).  These  data  thus  suggest 
that  the  extensive  genotypic  variation  identified  within  naturally-occurring  isolates  of  HIV-1 
described  in  the  present  study  are  not  the  result  of  changes  occurring  during  in  vitro  virus 
propagation  or  molecular  cloning,  although  it  is  recognized  that  under  conditions  of  selective 
pressure  significant  genetic  changes  in  the  HIV-1  genome  can  and  do  occur  in  vitro  (162). 

The  results  of  this  study  indicate  that  genotypic  variation  of  HIV-1  in  vivo  is  rapid  and 
extensive,  that  numerous  variant  viral  forms  coexist  over  time  within  the  same  patient,  and  that 
"isolates"  of  HIV-1  actually  consist  of  complex  mixtures  of  genotypically-distinct,  albeit  related, 
viruses.  The  present  study  also  indicates  that  during  natural  infection  by  HIV-1,  different  viral 
genomes  evolve  in  parallel  and  result  in  the  emergence  and  persistence  of  multiple  distinct 
genotypic  forms. 

Variation  of  HIV-1  has  parallels  in  other  lentiviral  systems  including  equine  infectious 
anemia  virus  (EIAV),  visna  virus,  and  simian  immunodeficiency  virus  (SIV)  (119,120,154).  For 
EIAV,  it  is  clear  that  genotypic  changes  are  responsible  for  biologically  important  alterations 
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in  viral  antigenicity  which  allow  the  virus  to  elude  host  immune  defenses  (113).  For  visna,  the 
significance  of  antigenic  variation  is  less  clear  (154).  In  the  SIV  system,  a  virus  strain  has 
recently  been  isolated  from  a  pig-tailed  macaque  that  possesses  altered  biologic  and  antigenic 
properties  leading  to  a  broader  host-range  and  a  rapid,  fatal  immunodeficiency  syndrome 
several  weeks  after  inoculation.  There  are  indications  that  genotypic  variation  in  HIV-1  is 
similarly  associated  with  potentially  important  biologic  differences  among  variant  forms.  We 
have  examined  the  biologic  properties  of  viruses  derived  by  transfection  of  full-length  WMF1.16 
and  WMF3.3  clones  (patterns  WMFIa  and  WMF3c  in  Figure  10)  and  found  them  to  be 
indistinguishable  from  wild-type  virus  in  T-cell  tropism  and  cytopathicity  (data  not  shown).  We 
have  also  examined  the  biologic  properties  of  hybrid  viruses  constructed  by  exchanging  the 
envelope  region  of  six  different  RJS4  clones  (numbers  6,  15,  16,  22,  24,  and  26)  into  the 
transfection-competent  HIV-1  prototype  clone  HXB-2  (51)  and  have  found  that  the  progeny 
viruses  exhibit  striking  biological  differences.  These  findings  are  especially  intriguing  in  light 
of  recent  findings  by  Mullins  and  co-workers  who  showed  that  the  envelope/LTR  region  of  a 
replication-defective  variant  of  feline  leukemia  virus  (FeLV),  when  introduced  into  a  replication 
competent  construct  of  FeLV,  was  responsible  for  inducing  a  highly  reproducible  fatal 
immunodeficiency  illness  in  cats  (118).  Furthermore,  Gartner  and  Popovic  have  shown  that 
some  isolates  of  HIV-1  preferentially  replicate  in  mononuclear  phagocytes  whereas  others  show 
preference  for  T-lymphocytes  (58),  and  Koyanagi  and  Chen  have  identified  genotypically- 
distinct  virus  that  replicate  in  mononuclear  phagocytes  and  brain  glioma  explant  cultures  (96). 
Our  results  demonstrate  the  extreme  plasticity  of  the  HIV-1  genome  as  a  genetic  substrate  for 
important  biological  variation.  Finally,  the  observation  that  HIV-1  "isolates"  generally  consist 
of  complex  mixtures  of  genotypically-distinct  viruses  is  in  keeping  with  the  general  concept  of 
RNA  viruses  as  "quasispecies"  (43,150).  Experiments  examining  the  genetic,  biologic,  and 
immunologic  properties  of  HIV-1  will  need  to  account  for  this  extensive  genotypic  heterogeneity, 
and  in  some  instances  may  require  the  use  of  biologically  or  molecularly  cloned  viruses. 
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3.  To  analyze  the  deduced  amino  acid  sequence  of  the  envelope  of  HIV-1  for 

predicted  secondary  structure  and  antigenicity. 

Most  patients  with  acquired  immunodeficiency  syndrome  (AIDS)  or  AIDS-related  complex 
show  specific  antibodies  directed  against  proteins  of  human  immunodeficiency  virus  (HIV), 
which  have  virus  neutralizing  activity  (10,133,159)  and  are  supposed  to  be  directed  against 
antigenic  determinants  located  on  the  surface  glycoprotein,  as  has  been  shown  for  other 
enveloped  virus  particles.  However,  the  virus  seems  to  have  adopted  properties  which  allow 
evasion  of  the  immune  surveillance  mechanisms  of  the  host.  Differences  among  various 
isolates  of  HIV  have  primarily  been  analyzed  at  the  nucleotide  sequence  level  of  independent 
isolates  and  also  in  sequential  viral  isolates  from  the  same  patient.  These  variations  seem 
to  be  concentrated  in  the  envelope  protein-encoding  region  of  HIV  and  may  be  fundamentally 
important  for  the  biology  and  pathogenicity  of  HIV.  For  this  reason,  and  for  the  development 
of  viral  antigens  for  diagnostic  or  vaccine  use,  it  is  important  to  identify  and  characterize 
antigenic  determinants  located  in  the  glycoprotein  complex  of  HIV  and  to  define  their  possible 
functions. 

We  analyzed  the  amino  acid  sequences  of  the  envelope  protein  complexes  derived  from 
the  nucleotide  sequences  of  seven  AIDS  virus  isolates,  three  of  which  represented  sequential 
isolates  from  the  same  patient.  This  work  was  an  extension  of  the  work  described  above  in 
sections  1  and  2  on  the  genetic  variability  of  the  HIV  envelope  protein  complex  which  mainly 
focused  on  the  DNA  and  primary  amino  acid  sequences.  By  computer  analysis  we  predicted 
the  secondary  structure  of  gp120  and  gp41,  the  cleavage  products  of  gp160,  and  predicted 
potential  antigenic  sites  by  superimposing  this  secondary  structure  with  the  values  for 
hydrophilicity,  flexibility,  surface  probability,  and  glycosylation.  Eleven  potential  antigenic  sites 
were  identified,  9  of  which  were  located  in  the  exterior  part  (gp120)  of  the  envelope  protein 
and  2  in  the  membrane-bound  portion  (gp41).  Five  highly  variable  regions  were  characterized, 
all  contained  in  gp120  coinciding  with  the  predicted  epitopes.  In  sequential  isolates  from  a 
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single  patient,  all  alterations  of  secondary  structures  occurred  in  those  regions  which  were 
identified  as  probable  antigenic  epitopes. 

The  open  reading  frames  of  the  envelope  gene  derived  from  the  nucleotide  sequence 
of  the  seven  virus  isolates  are  similar  in  size  and  encode  854  (WMJ3)  to  873  (HAT3)  amino 
acids  (Figure  13).  Methionine  codons  at  positions  8  (BH10  and  LAV1A)  or  9  (HAT3,  ARV2, 
WMJ1 ,  WMJ2,  and  WMJ3)  mark  the  beginning  of  the  envelope  protein  with  a  potential  leader 
sequence,  which  is  cleaved  from  the  envelope  precursor  protein  during  maturation.  The 
peptide  sequences  from  position  38  represent  the  envelope  precursor  gp160,  which  is  cleaved 
into  the  exterior  gp120  (with  23  to  25  potential  N-glycosylation  sites)  and  the  membrane-bound 
gp41  (containing  4  to  7  potential  N-glycosylation  sites).  A  stretch  of  positively  charged  amino 
acids  at  positions  512,  509,  508  (WMJ1,  WMJ2,  WMJ3),  517  (ARV2),  518  (BH10),  523 
(LAV1A),  and  275  (HAT3),  respectively,  marks  the  cleavage  site  (5,42). 

Although  the  overall  sizes  and  structures  of  the  seven  surface  proteins  are  rather 
similar,  the  deduced  amino  acid  sequences  differ  substantially.  On  the  average  only  66%  of 
the  amino  acids  are  conserved  in  the  exterior  part  of  the  protein,  and  these  changes  are 
clustered  in  special  regions  with  only  up  to  10%  conserved  amino  acid.  gp41,  the 
transmembrane  part  of  the  envelope  protein  complex,  shows  more  than  80%  conserved  amino 
acids  (Figure  13)  and  no  regions  of  high  variability.  Furthermore,  the  changes  in  the  latter 
region  are  all  due  to  point  mutations,  whereas  changes  in  gp120  frequently  result  from 
insertions  and  deletions  and  appear  as  clustered  mutations  interspersed  with  segments  which 
have  a  high  content  of  conserved  amino  acids  (89%). 

According  to  their  content  of  conserved  amino  acids,  glycosylation  sites,  and  (3-turn 
regions,  the  envelope  proteins  of  HIV  were  subdivided  into  highly  variable  and  constant 
regions.  Constant  regions  were  defined  to  contain  75%  or  more  conserved  amino  acids  and 
to  have  no  amino  acid  insertions  and  deletions.  In  contrast,  variable  regions  showed  a  low 
degree  of  conserved  residues  (25%  or  less)  and  a  great  variability  in  length  due  to  deletions 
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or  insertions;  these  changes  occurred  at  least  every  fifth  amino  acid. 

Due  to  the  high  number  of  conserved  amino  acid  residues  in  the  constant  regions, 
glycosylation  sites  and  secondary  structures  also  showed  a  low  degree  of  variation;  more  than 
50%  of  potential  glycosylation  sites  and  amino  acids  in  p-turn  configurations  were  found  to  be 
conserved.  Regions  of  high  variability  had  a  low  degree  of  conserved  p-turns  (0  to  25%), 
especially  in  a  hydrophilic  environment,  and  almost  no  conserved  potential  glycosylation  sites 
(0  to  25%).  By  these  criteria,  surface  glycoprotein  gpl  60  could  be  subdivided  into  a  clustered 
pattern  of  highly  variable  (VI  to  V5)  and  constant  regions  (Cl  to  C6).  For  purposes  of 
classification  the  minimum  length  of  a  region  was  set  at  10  amino  acid  residues;  the  locations 
and  parameters  of  those  regions  are  shown  in  Figures  13-16. 

For  further  analysis  we  predicted  antigenic  determinants  in  the  amino  acid  sequences 
with  a  computer  program  which  predicted  the  secondary  structure  and  calculated  the  values 
for  hydrophilicity,  flexibility,  and  suiface  probability  (Figures  14  and  15).  These  regions  are 
mainly  located  in  p-turn  regions  which  show  a  high  degree  of  nonhydrophobic  of  flexible  amino 
acid  sequences  or  are  predicted  to  have  a  high  probability  of  location  at  the  surface  of  the 
polypeptide  (32,46,47,62,121). 

in  gpl  20,  nine  epitopes  (I  to  IX)  with  a  high  antigenic  potential  can  be  predicted  (Figure 
16).  Epitope  I  is  located  in  a  region  of  high  variability  (VI)  at  amino  acids  137  to  154  of  viral 
strain  WMJ1 .  The  locations  of  the  antigenic  epitopes  for  the  other  isolates  are  indicated  in 
Table  2. 

Epitope  I  has  elevated  values  for  hydrophilicity  and  flexibility,  1  to  3  potential 
glycosylation  sites,  and  a  number  (up  to  11  in  LAV-la)  of  amino  acids  in  p  turn  configurations. 
Isolates  WMJ1  and  WMJ3  show  no  p  turns  due  to  the  high  variability  of  this  region.  Epitope 
II  (amino  acids  186  to  203)  is  situated  in  variable  region  V2  and  contain  s  residues  with  high 
values  for  hydrophilicity,  flexibility,  and  surface  probability;  up  to  15  amino  acids  (H.-’  T-3)  have 
p  turn  configurations  (with  the  exception  of  ARV2),  and  all  strains  may  be  glycosylated. 
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Epitope  Ml  (amino  acids  232  to  246)  in  constant  region  C2  shows  two  conserved  hydrophilic 
p  turns,  two  conserved  potential  glycosylation  sites,  and  high  values  of  flexibility.  Epitope  IV 
(Amino  acids  300  to  320)  in  variable  region  V3  shows  2  to  8  amino  acids  in  p  turns,  with 
elevated  values  for  hydrophilicity,  flexibility,  and  surface  p-- ^ability,  and  has  1  to  2  potential 
glycosylation  sites.  Epitope  V  (amino  acids  358  to  375)  is  only  slightly  variable  (72% 
conserved  amino  acids),  has  high  values  for  hydrophilicity,  surface  probability  and  flexibility, 
and  has  1  to  2  N-glycosylation  sites  and  6  to  8  residues  in  p  turn  configurations.  Epitope  VI 
(amino  acids  394  to  412),  in  contrast,  is  in  highly  variable  region  V4,  with  2  to  4  possible 
glycosylation  sites,  4  to  1 1  p  turns,  and  high  values  for  hydrophilicity  and  flexibility;  the  values 
for  surface  probability  are  not  elevated.  Epitope  VII  (amino  acids  445  to  458)  in  constant 
region  C3  has  4  to  7  p  turns.  One  glycosylation  site  is  hydrophilic  and  flexible  and  is  directly 
followed  by  epitope  VIII  (amino  acids  459  to  469),  which  shows  high  variability  (V%),  is 
hydrophilic,  flexible,  and  possibly  glycosylated,  and  has  one  (ARV2)  to  six  residues  in  p  turn 
configurations.  Epitope  IX  (amino  acids  470-483)  in  region  C4  has  high  values  for  ail 
parameters,  four  conserved  p  turns,  and  no  potential  glycosylation  sites. 

Only  two  antigenic  sites  could  be  identified  in  gp41.  These  contain  amino  acids  612 
to  613  and  722  to  745  (Figures  13-16).  The  first  epitope  X,  is  located  in  a  slightly  variable 
region  (56%  conserved  amino  acids)  and  contains  four  conserved  glycosylation  sites,  has  high 
values  for  hydrophilicity,  surface  probability,  and  flexibility,  has  4  to  6  p  turns,  and  is  probably 
the  only  antigenic  site  in  gp41  which  is  located  outside  the  lipid  bilayer  and  accessible  to 
antibody  reaction  and  recognition.  The  second  epitope  (amino  acids  722  to  745)  is  located 
directly  after  a  stretch  of  hydrophobic  amino  acids  which  is  likely  to  be  transmembrane  region 
of  gp41  (TM3);  these  hydrophilic,  flexible  amino  acids  in  p  turn  regions  with  high  surface 
probability  probably  represent  the  hydrophilic  anchor  sequence  which  has  been  identified  in 
most  membrane  proteins.  This  region,  however,  should  be  inside  the  cell  and  thus  is  not  the 
best  epitope  for  antigenic  response.  The  following  stretch  of  about  100  amino  acids  has  a 
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further  region  which  might  be  an  antigenic  determinant.  This  region,  whose  function  is  not 
known,  might  however,  be  gradually  cleaved  off  by  proteases  from  the  precursor  before 
maturation  of  gp4l.  With  antibodies  against  a  synthetic  peptide  derived  from  this  region, 
mainly  the  precursor  gp160  could  be  identified;  furthermore,  the  molecular  size  of  a 
glycosylated  gp41  should  be  about  52,000  to  54,000  daltons  by  calculation  (42,000  daltons  of 
the  primary  product  plus  the  molecular  size  of  the  carbohydrate  moiety)  unless  it  is 
proteolytically  processed. 

The  envelope  proteins  of  the  sequential  viral  isolates  from  the  same  patient  which  were 
taken  at  intervals  of  3  and  4  months  are  very  similar  in  size  and  show  only  three  amino  acid 
deletions  or  insertions.  Most  changes  are  due  to  point  mutations.  Most  of  the  alterations  in 
the  amino  acid  sequences  which  are  due  to  those  mutations  are  located  in  those  regions 
which  were  found  to  be  highly  variable.  Differences  in  the  secondary  structures  (Figure  13,14) 
were  identified  in  regions  VI,  V2,  V3,  and  V4,  corresponding  with  antigenic  epitopes  I,  II,  IV, 
and  VI,  and  in  epitopes  V  and  X  which  are  located  in  slightly  variable  regions  (76  and  56% 
of  conserved  amino  acids,  respectively).  A  further  (3  turn  alteration  occurs  in  region  C5  in  a 
short  hydrophilic  environment  which  may  be  located  between  two  transmembrane-spanning 
regions.  That  means  that  ail  alterations  concerning  the  secondary  structure  of  the  envelope 
protein,  with  the  exception  of  that  in  region  C5,  are  located  in  the  predicted  antigenic 
determinants;  epitopes  III,  VII,  VIII,  IX,  and  XI  are  conserved.  Interestingly,  some  of  the  (3  turn 
alterations  identified  in  isolates  WMJ1  and  WMJ2  are  reversed  in  isolate  WMJ3  (I,  II,  IV,  and 
VI),  even  if  there  are  further  variations  in  the  amino  acid  sequence  (I  and  IV). 

After  cleavage  of  the  leader  peptide  from  precursor  protein  gpl  60,  gp120  represents 
the  highly  glycosylated  exterior  part  of  the  envelope  protein  complex  of  HIV.  This  polypeptide 
part  is  likely  connected  only  via  the  ionic  interactions  of  14  positively  charged  amino  acids  in 
the  carboxy-terminal  region  of  gpl 20  (region  C4)  adjacent  to  the  cleavage  site  with  the 
negativity  charged  phosphate  groups  in  the  bilayer  membrane.  There  are  18  conserved 
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cysteine  residues  dispersed  over  the  gp120  sequence  (Figure  13)  which  might  also  be  involved 
in  the  complex  formation  of  gp120  and  gp41.  A  report  describing  the  frequent  loss  of  gp120 
from  the  particles  during  purification  and  immunoelectron  microscopy  argues,  however,  against 
a  direct  involvement  of  disulfite  bonds  in  the  complex  formation  of  gp120  and  gp41. 

The  transmembrane  polypeptide  gp41  contains  three  stretches  of  hydorphobic  amino 
acids.  Directly  after  the  cleavage  site  there  are  about  60  hydorphobic  residues  (amino  acids 
511  to  571)  which  are  interrupted  by  a  stretch  of  about  10  amino  acids  that  are  predicted  to 
have  a  high  surface  probability  and,  in  comparison  with  the  surrounding  residues,  a  higher 
hydrophilicity  and  flexibility  and  some  alterations  in  p  turns.  From  the  patterns  of  other  viral 
(121)  or  membrane  proteins,  e.g.,  bacteriorhodopsin,  acetylcholine  receptor,  and  erythrocyte 
band  III,  such  regions  are  known  to  represent  sequential  transmembrane-spanning  regions. 
A  domain  of  hydrophilic,  flexible,  and  charged  amino  acids  (amino  acids  to  572  to  594)  linked 
to  the  hydrophobic  region  may  represent  a  hydrophilic  anchor  sequence. 

A  transmembrane  region,  which  was  suggested  from  amino  acids  670  to  695  with  its 
hydrophilic  anchor  has  already  been  mentioned.  In  transmembrane  regions  TM2  and  TM3 
three  charged  amino  acids  (Arginine  at  positions  557  and  707  and  glutamic  acid  at  position 
560)  are  identifiable.  Those  charged  amino  acids  were  also  reported  in  other  transmembrane 
regions  in  multi-spanning  proteins  and  do  not  lead  to  disturbance  of  the  overall  hydrophobic 
character  of  a  transmembrane  region.  Between  TM2  and  TM3  hydrophobic  regions  from  the 
N  terminus  of  gp41  are  90  amino  acid  residues  (580  to  670)  which  should  be  facing  the  outer 
side  of  the  lipid  bilayer.  Within  this  region  are  four  conserved  glycosylation  sites  and  a  good 
antigenic  probability  (epitope  X).  Alternatively,  TM1  and  TM2  may  represent  an  apolar 
hydrophobic  stretch  of  amino  acids  located  outside  the  viral  membrane  and  may  play  a  role 
similar  to  the  paramyxovirus  fusion  protein  by  penetrating  into  the  membrane  of  the  fusion 
partner.  The  same  could  occur  if  the  C-terminal  part  of  gp41 ,  including  or  not  including  TM3, 
would  be  directed  to  the  outer  membrane  or  be  in  equilibrium  between  those  two  orientations. 
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This  possibility  is  supported  by  the  observation  that  antiserum  gains  a  peptide  from  amino 
acids  725  to  752  gives  cell  surface  labeling  and  is  neutralizing.  The  predicted  antigenic 
determinants  are  not  affected  by  the  discussed  models. 

4.  To  determine  the  molecular  and  biologic  properties  of  HIV-2  isolates  from 

individuals  in  West  Africa  where  HIV-2  appears  to  be  less  virulent  compared  with 

HIV-1  and  some  other  strains  of  HIV-2. 

Serological  studies  first  suggested  that  a  new  type  of  HIV-1  related  virus  might  be 
present  in  West  African  human  populations  (87).  It  was  suggested  that  this  virus,  then  termed 
"HTLV-4"  might  be  closely  related  to  a  similar  virus  isolated  from  African  green  monkeys,  then 
terms  "STLV-3"  (87,88).  Because  of  similarities  in  structure  and  antigenicity  among  HIV-1, 
HIV-2  and  STLV-3  (also  referred  to  as  SIV,  simian  immunodeficiency  virus),  some  relation 
might  be  expected  between  their  nucleic  acid  sequences  and  phylogenesis.  Restriction 
enzyme  analysis  showed  that  six  out  of  six  STLV-3AGM  isolates  and  two  out  of  three  HTLV-4 
isolates  were  identical  in  32  out  of  32  restriction  sites  mapped.  The  third  HTLV-4  isolate  was 
identical  to  the  others  in  31  out  of  32  mapped  restriction  sites.  Of  three  STLV-3MAC  isolates 
obtained  independently  at  the  New  England  Regional  Primate  Center,  one  (isolate  251)  was 
identical  to  the  STLV-3AGM/HTLV-4  consensus  pattern  in  23  out  of  23  sites,  and  the  others  were 
identical  to  each  other  and  similar  to  the  consensus  pattern  in  21  out  of  23  sites.  These 
findings  raised  the  suspicion  that  HTLV-4  and  STLV-3  reported  to  have  been  isolated  from 
West  African  people  and  monkeys  could  represent  laboratory  containments  with  SIVMAC  isolate 
from  macaque  monkeys  housed  in  the  NERPC  (39). 

To  define  more  precisely  the  relation  between  different  HTLV-4,  STLV-3,  HIV-2  and 
HIV-1  viruses,  we  performed  a  combination  of  restriction  endonuclease  cleavage  mapping, 
molecular  cloning,  and  nucleotide  sequencing  to  analyze  and  compare  different  isolates  of 
HTLV-4  and  STLV-3.  The  DNA  from  two  HTLV-4  isolates  (HTLV-4/PK82,  HTLV-4/PK289)  and 
two  STLV-3  isolates  (STLV-3AGM/K1 ,  STLV-3AQM/Mm  6324)  was  extracted  from  virally-infected 


28 


cell-pellets  without  interim  cell  cultivation.  Two  other  virus  isolates  (HTLV-4/PK190, 
STLV-3AGM/K1 )  were  cultured  and  used  to  prepare  RNA  for  a  viral-specific  partial 
complementary  DNA  clone.  A  viral-specific  insert  from  one  of  these  clones  (HTLV-4/PK190, 
clone  pV2)  was  used  to  probe  recombinant  lambda  phage  libraries  prepared  from  DNA  infected 
with  HTLV-4/PK82  and  HTLV-4/PK289.  Full-length  proviral  DNA  clones  including  flanking 
cellular  sequences,  were  obtained  for  both  HTLV-4  viruses.  The  restriction  enzyme  cleavage 
patterns  of  STLV-3AGM/K1 ,  STLV-3AGM/Mm6324,  HTLV-4/PK82,  HTLV-4/PK289  and  of  the  full- 
length  lambda  phage  clones  derived  from  the  two  HTLV-4  isolates  were  compared.  Eight 
restriction  enzymes  (Ndel,  Pstl,  Xbal,  BamHI,  Hind  III,  Sstl,  Bgl  II,  EcoRI)  were  used.  The 
restriction  cleavage  patterns  of  these  isolates  and  proviral  clones  were  identical  in  19  out  of 
IS  sites  mapped,  and  corresponded  to  the  HTLV-4/STLV-3AGM  consensus  map. 

The  DNA  from  proviral  or  cDNA  clones  of  HTLV-4/PK82,  HTLV-4/PK190  and 
STLV-3agm/K1  was  sequenced  to  define  their  relation  to  each  other  and  to  HIV-2  and  other 
STLV-3  isolates.  For  HTLV-4/PK82,  the  regions  sequenced  included  the  entire  long  terminal 
repeat  (LTR),  gag  pi  6  equivalent,  central  region,  envelope,  and  3'  open  reading  frame  (5,391 
base  paris).  For  HTLV-4/PK190  and  STLV-3AGM/K1 ,  the  sequence  included  R  and  U3  LTR 
regions  and  portions  of  3'  orf  (850  and  1,160  base  paris,  respectively).  A  summary  of  this 
nucleotide  sequence  information  is  shown  in  Figure  17  and  has  been  submitted  to 
EMBL/GenBank  under  accession  code  number  Y00283. 

From  this  comparison  of  the  sequences  of  HTLV-4/PK190  and  three  different  STLV-3AGM 
(K78,  K6W,  K1),  none  differ  from  HTLV-4/PK82  by  more  than  1.1%.  In  the  envelope  gene, 
which  varies  most  widely  in  independent  isolates  of  HIV-1,  the  three  STLV-3AGM  sequences  vary 
from  HTLV-4/PK82  by  0.8%  or  less.  But  the  virus  STLV-3mac/M ml 42-83  differs  from  both 
HTLV-4/PK82  and  STLV-3AGM/K6W  by  about  4.7%  in  envelope.  Figure  18  depicts  the 
distribution  of  deduced  amino  acid  changes  present  in  the  envelopes  of  different  STLV-3AGM, 
STLV-3mac,  and  HIV-2  viruses  with  respect  to  HTLV-4/PK82.  Compared  to  HTLV-4/PK82,  there 
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were  10  amino  acid  differences  in  STLV-3AGM/K6W,  14  in  STLV-3AGM/K78,  and  65  in  STLV- 
3M Ac/Mm 142-83.  All  three  STLV-3  and  both  HTLV-4  sequences  (Figures  17  and  18)  contain 
an  identical  stop  codon  in  the  envelope  transmembrane  coding  region.  Also, 
STLV-3mac/M ml 42-83  compared  to  HTLV-4/PK82  contains  clustered  nucleotide  and  amino  acid 
substitutions  and  a  three-base-pair  (bp)  insertion.  In  the  same  regions  of  the  viral  envelope, 
HTLV-4  and  STLV-3AGM  are  identical.  As  a  result  of  the  nearly  identical  restriction  patterns  and 
nucleotide  sequences  of  three  different  HTLV-4  and  six  different  STLV-3AGM  isolates,  and 
because  of  the  failure  to  re-isolate  such  STLV-3-like  viruses  from  humans,  and  in  view  the  own 
isolation  of  a  distinct  HIV-2-related  virus  from  a  healthy  Senegalese  woman,  we  concluded  that 
HTLV-4  and  STLV-3AGM  are  not  independent  laboratory  isolates,  but  instead  represent 
transmission  of  a  single  laboratory  virus  strain  to  multiple  cultures.  Furthermore,  the  various 
HTLV-4  and  STLV-3AGM  isolates  probably  originated  from  cultures  of  STLV-3MAC  isolate  251, 
because  the  HTLV-1 4/STL V-3AQM  restriction  site  consensus  pattern  coincides  precisely  with 
isolate  251  in  32  out  of  32  mapped  sites,  but  is  similar  to  STLV-3MAC  isolate  142  in  only  27 
out  of  31  sites. 

Despite  showing  that  HTLV-4  was  not  a  bone  fide  virus  isolate,  it  was  clear  from 
clinical  and  serological  data  that  West  African  human  populations  were  infected  with  an  HIV 
related  virus  that  was  more  similar  to  SIV  than  to  HIV-1.  We  thus  attempted  to  isolate 
retroviruses  from  the  peripheral  blood  of  four  healthy  seropositive  Senegalese  prostitutes. 
Mononuclear  cell  preparations  were  cocultivated  with  uninfected  normal  donor  lymphocytes. 
After  1-2  weeks,  transient  reverse  transcriptase  activity  was  detected  in  culture  supernatants 
from  three  individuals,  and  virus  from  one  of  these  cultures  (isolate  ST)  was  transmitted 
successfully  to  four  immortalized  T-cell  lines,  SUPT1,  HUT78,  H9,  and  CEM174.  On  the  basis 
of  its  ultrastructural  morphology,  particulate  reverse  transcriptase  activity,  T4  cell  tropism  and 
antigenic  reactivity  by  Western  blot  hybridization  analysis,  this  virus  was  related  to  both  HIV-2 
and  STLV-3  (SIV).  Southern  blots  of  virally-infected  cellular  DNA  from  these  ST  cultures  using 
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DNA  probes  specific  for  HIV-2  and  for  HTLV-4/STLV-3,  however,  clearly  showed  the  ST 
isolate  to  be  more  related  to  HIV-2  than  to  STLV-3/HTLV-4,  as  shown  in  Figure  19.  The 
proviral  DNA  of  the  Senegalese  isolate  hybridized  preferentially  to  HIV-2  and  the  restriction 
pattern  of  ST  was  unique.  Moreover,  two  related  yet  distinguishable  viral  genotypes  were 
found  in  the  ST  isolate,  reflecting  a  genotypic  heterogeneity  STLV-3  (SIV)  virus  was  not 
detected  in  any  of  the  four  Senegal  cultures.  These  data  demonstrate  the  existence  of  HIV-2 
in  Senegal  and  suggest  that  West  African  individuals  seropositive  for  HTLV-4  are  indeed 
infected  with  a  retrovirus  but  that  this  belongs  to  the  HIV-2,  not  STLV-3  (SIV),  group. 

Although  human  immunodeficiency  virus  type  1  (HIV-1)  is  the  etiologic  agent  of 
epidemic  AIDS  in  Central  Africa,  Europe,  the  United  States,  and  most  countries  worldwide, 
many  cases  of  AIDS  in  West  Africa  have  been  attributed  to  the  related  retrovirus,  human 
immunodeficiency  virus  type  2  (HIV-2)  (26-28).  Although  it  is  clear  that  HIV-2  in  many 
instances  causes  fatal  immunodeficiency  (26-28),  recent  studies  (41,86,107)  have  suggested 
that  many  West  African  individuals  as  a  group,  when  infected  with  HIV-2,  may  have  less 
severe  immunodeficiency  than  individuals  infected  with  HIV-1.  These  studies  involved 
widespread  serologic  testing  of  West  African  populations  by  immunoblot  and 
radioimmunoprecipitation  (RIP)  assays  with  viral-specific  target  antigens  derived  from  cultures 
of  HTLV-4/PK82  (87),  a  virus  strain  shown  above  and  elsewhere  (76,91,94)  to  represent  a 
simian  retrovirus  termed  SIVMAC  (simian  immunodeficiency  virus  of  macaques).  In  our  studies 
described  above,  we  characterized  an  HIV-2  related  virus  (HIV-2/ST)  from  a  healthy  prostitute 
in  Senegal,  West  Africa  which  represented  the  first  independent  virus  isolation  from  this  region. 

The  biologic  and  genetic  properties  of  this  virus  distinguish  it  from  other  known 
immunodeficiency-associated  retroviruses,  and  raise  the  possibility  that  such  strains  of  HIV-2 
may  be  responsible  for  altered  disease  presentations  in  naturally  infected  human  populations. 

HIV-2/ST  was  isolated  from  peripheral  blood  mononuclear  cells  (PBMC)  of  patient  ST. 
Virus  was  transmitted  to  four  immortalized  T-cell  lines  (Hut78,  H9,  SupTI ,  and  CEMx174)  using 
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repeated  PEG  (polyethylene  glycol)  precipitations  of  primary  culture  supernatants  in  an  attempt 
to  isolate  and  amplify  viruses  with  potentially  attenuated  virulence.  The  DNA  genome  of 
HIV-2/ST  in  these  cultures  hybridized  at  low  and  high  stringency  to  the  prototype  HIV-2/ROD 
probe  (Figure  20)  whereas  SIVMAC  DNA  hybridized  to  the  HIV-2/ROD  probe  only  under 
conditions  of  low  stringency.  Conversely,  HIV-2/ROD  and  HIV-2/ST  DNA  hybridized  with  equal 
intensity  to  SIVMAC  probe,  but  only  at  low  stringency,  and  a  full-length  HTLV-1  probe  gave  no 
hybridization.  Two  different  HIV-2/ST  producing  cell  lines  derived  from  the  transmission  of  cell- 
free  virus  from  primary  ST  lymphocyte  co-cultures  exhibited  very  similar  proviral  restriction 
enzyme  cleavage  patterns  but  were  distinguishable  by  polymorphic  Xba  I  and  BamHI  sites 
(Figure  20).  A  recombinant  DNA  lambda  phage  library  prepared  from  one  of  these  cell  lines, 
Hut  78/BI 2,  which  represented  a  biologically  cloned  high  producer  line,  yielded  three  full- 
length  proviral  DNA  clones  that  were  evaluated  by  detailed  restriction  analyses.  Comparative 
restriction  mapping  confirmed  the  genetic  similarity  between  HIV-2/ST  and  HIV-2/ROD  (10  of 
25  restriction  sites  in  common)  and  the  dissimilarity  between  HIV-2/ST  and  SIVMAC  (2  of  36 
restriction  sites  in  common)  (Figure  20).  Furthermore,  one  of  these  proviral  DNA  clones 
(lambda  JSP4-27),  when  transfected  into  T-lymphocyte  cell  lines,  produced  virus  that  was 
replication-competent  and  infectious  and  whose  biological  properties  were  identical  to  the 
parental  virus  isolate.  Thus,  HIV-2/ST  is  genotypically  more  related  to  HIV-2/ROD  than  to 
other  primate  viruses,  and  within  the  HIV-2/ST  isolate  there  exist  polymorphic  genotypes  some 
of  which  are  fully  replication  competent. 

The  structural,  antigenic,  and  functional  characteristics  of  HIV-2/ST  proteins  were 
evaluated  by  immunoblot  and  RIP  analyses  (Figure  21).  The  putative  major  gag  structural 
protein  and  precursor  (p26/pr55)  of  HIV-2/ST  were  comparable  in  size  to  p26/pr55  of 
HIV-2/ROD,  p27/pr55  of  SIVMAC,  and  p24/pr53  of  HIV-1.  Polymerase  and  endonuclease 
proteins  of  HIV-2/ST  were  tentatively  identified  as  p64  and  p34  proteins  with  equivalent 
counterparts  in  HIV-2/ROD,  SIVMAC,  and  HIV-1.  The  env  proteins  of  HIV-2/ST  revealed  a  140 
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kD  putative  extracellular  protein  similar  to  that  in  HIV-2/ROD.  However,  in  different  subcultures 
of  HIV-2/ST  that  were  derived  by  single  cell  cloning  of  the  original  SupTI/LKOOl  cell  line,  the 
putative  transmembrane  proteins  (TMP)  were  either  42  kD  or  30  KD  in  size.  Only  smaller- 
sized  transmembrane  proteins  were  seen  in  HIV-2/ROD  (36  kD)  and  SIVMAC  (32  kD)  cultures. 
The  HIV-2/ST  viruses  with  large  and  small  transmembrane  proteins  could  be  transmitted  cell 
free,  were  replication  competent,  and  retained  the  same  size  of  TMP  throughout  passage  in 
the  same  cell  targets  (SupTI  and  CEMx174).  This  suggests  that  the  observed  differences  in 
TMP  sizes  were  due  to  nucleotide  sequence  differences  in  the  expressed  mRNAs,  not  to  post- 
transcriptional  modifications  such  as  altered  glycosylation  patterns.  Consistent  with  this 
suggestion  is  the  finding  of  translational  stop  codons  in  HIV-2/ROD  and  SIVMAC  viral  envelope 
genes  which  result  in  the  expression  of  truncated  TMPs  (28).  Antigenically,  HIV-2/ST  and 
HIV-2/ROD  were  similar  to  each  other  and  to  SIVMAC  in  their  reactivity  with  HIV-2  antisera  and 
were  cross- reactive  with  HIV-1  only  in  the  major  gag  structural  proteins  (p24-p27).  This  was 
confirmed  by  bi-directional  immunoblotting  in  which  both  HIV-1  and  HIV-2  specific  human 
antisera  were  used  to  probe  HIV-1,  HIV-2/ROD,  and  HIV-2/ST  proteins  (Figure  21).  By  RIP, 
we  found  env  precursor  proteins  of  approximately  180  or  170  kD  in  the  different  HIV-2/ST 
subcultures  (reflecting  the  different  sizes  of  their  respective  transmembrane  proteins)  and 
mature  extracellular  envelope  proteins  of  140  kD  (Figure  21).  By  RIP,  we  also  found  that,  like 
HIV-1,  the  HIV-2/ST  external  env  glycoprotein  bound  directly  to  an  epitope  on  CD4  recognized 
by  OKT4A  (Figure  2C),  but  not  by  OKT4  (112).  Other  experiments  with  fluorescence  activated 
cell  sorting  analysis  showed  that  infection  of  CD4  bearing  cells  by  HIV-2/ST  virus  down- 
modulated  the  expression  of  CD4(T4)  but  not  CD8(T8),  again  analogous  with  HIV-1  infection 
(111). 

Because  of  the  evidence  suggesting  that  certain  West  African  human  populations 
infected  with  HIV-2  related  viruses  may  have  less  severe  immunodeficiency  than  do  individuals 
infected  with  HIV-1,  or  some  isolates  of  HIV-2  including  HIV-2/ROD  (41,86,107),  we  evaluated 


33 


HIV-2/ST  virus  for  cytopathic  and  cell  killing  properties  in  vitro.  We  first  studied  the  induction 
of  syncytia  by  HIV-2/ST  infected  cells  co-cultured  with  a  panel  of  different  CD4+  cells.  As 
shown  in  Figure  22,  both  HIV-1/lllb  and  HIV-2/ROD,  but  not  HIV-2/ST,  produced  large  syncytia 
when  virally-infected  cells  were  co-cultured  with  uninfected  CD4*  cells.  Hela-T4,  H9,  and  Hut 
78  indicator  cells  gave  identical  results.  Syncytia-induction  was  inhibited  by  the  monoclonal 
antibody  Leu3a  and  no  syncytia  were  formed  by  any  HIV-1  or  HIV-2  infected  cells  when 
co-cultured  with  CD4' cells.  Cytopathicity  was  also  assessed  by  direct  determination  of  cell 
killing.  HIV-2/ROD  and  two  different  isolates  of  HIV-1  (lllb  and  BC)  caused  marked  cell  killing 
of  SupTI  cells,  PHA-stimulated  normal  donor  lymphocytes  (Fig.  4,  C  and  D),  as  well  as  Hut78 
and  H9  cells  (Figure  23).  Eciuralent  amounts  of  HIV-2/ST  showed  no  detectable  cell  killing 
activity  against  the  same  cell  targets.  Even  10-  and  100-fold  more  HIV-2/ST  virus  than  three 
other  HIV-1  isolates  (WMJ,  RH,  BC)  led  to  only  transient  depression  in  cell  counts.  These 
cell-killing  experiments  were  repeated  three  times  with  identical  results,  each  time  with  the  use 
of  newly  concentrated  virus  stocks.  Syncytia  induction  that  accompanied  virus  infection  in 
these  experiments  was  also  scored  and  was  dramatic  in  size  and  number  in  cultures  infected 
by  HIV-2/ROD  and  the  four  HIV-1  isolates  but  absent  in  cultures  infected  with  HIV-2/ST.  The 
virus  stocks  used  for  these  experiments  were  obtained  from  the  original  uncloned  HIV-2/ST 
infected  line  SupTI  /LK001  and  were  infectious  and  replication-competent  on  the  basis  of  their 
ability  to  infect,  replicate  in,  and  reinfect  by  cell-free  passage  CD4T  cells.  Furthermore,  the  cell 
targets  shown  in  Figure  23,  and  the  virus  infected  cells  used  for  the  syncytia  induction  assays 
in  Figure  22,  were  comparably  infected  with  HIV-2/ST,  HIV-2/ROD,  and  HIV-1  based  on  viral- 
specific  immunofluorescence,  in  situ  hybridization,  and  supernatant  particulate  reverse 
transcriptase  assays. 

In  previous  studies  of  the  infectivity  and  fusion  activity  of  SIVMAC,  it  was  observed  that 
a  cell  line  (CEMx174)  representing  a  somatic  cell  hybrid  between  a  T-cell  (CEM)  and  a  B-cell 
(B721.174),  but  neither  parental  cell  line  itself,  was  exquisitely  sensitive  to  virus  infection  and 
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syncytia  induction  (82).  We  therefore  used  this  hybrid  cell  line  (CEMx174)  in  cytopathicity 
assays  with  HIV-2/ST.  C£Mx174  cells  were  highly  susceptible  to  HIV-2/ST  infection  and  were 
permissive  for  permanent,  high-titer  virus  production.  Syncytia  induction  and  cell-killing  assays 
showed  a  modest  cytopathic  effect  but  again  much  less  than  for  prototype  strains  of  HIV-1  and 
HIV-2.  To  evaluate  the  basis  for  the  attenuation  of  HIV-2/ST,  we  used  in  situ  hybridization 
to  analyze  at  the  single  cell  level  the  rate  of  virus  entry  and  early  gene  expression  after  cell- 
free  viral  transmission  (Figure  24).  Compared  to  prototype  strains  of  HIV-1  and  HIV-2, 
HIV-2/ST  showed  markedly  delayed  onset  of  viral  RNA  production  in  CEMx174  as  well  as  in 
SupTI  and  Hut  78  following  exposure  to  cell  free  virions.  However,  once  infected,  cells 
expressed  equivalent  amounts  of  HIV-2/ST  RNA  as  compared  with  HIV-2/ROD  and  HIV-1/lllb, 
and  produced  equivalent  amounts  of  mature  virus  as  assessed  by  RT  activity  and  viral 
structural  proteins.  This,  and  the  fact  that  HIV-2/ST  is  less  fusogenic,  suggests  that  the  defect 
in  HIV-2/ST  is  at  the  level  of  virus  entry,  not  expression. 

Previous  studies  (3,  4,  12)  have  shown  that  viruses  of  the  HIV-2  group  may  cause 
severe  and  fatal  immunodeficiency.  Viruses  cultured  from  these  diseased  patients  were  readily 
isolated  by  standard  lymphocyte  co-culture  techniques  and  showed  peaks  of  supernatant  RT 
activity  and  cytopathic  effects  within  the  first  2  weeks  of  culture,  similar  to  HIV-1.  We  had 
considerable  difficulty  in  establishing  virus  isolates  from  healthy  Senegalese  subjects.  Out  of 
four  patients  originally  studied,  RT  activity  in  lymphocyte  co-culture  supernatants  was  detected 
in  three,  but  in  only  one  case  was  a  virus  (HIV-2/ST)  successfully  amplified  and  transmitted 
to  permanently  producing  immortalized  T-cell  lines.  Taken  together  with  the  cytopathicity  and 
infectivity  data,  these  results  suggest  that  certain  strains  of  HIV-2  may  be  less  virulent  than 
others  and  in  turn  be  associated  with  a  less  severe  or  altered  expression  of  immune 
deficiency.  The  molecular  basis  for  the  attenuated  virulence  of  HIV-2/ST  in  vitro  is  not  known. 
However,  the  demonstration  that  gp120-T4(CD4)  binding  is  followed  by  down-modulation  of  cell 
surface  T4(CD4)  expression  but  not  cell  fusion,  and  the  apparent  delay  in  cell  entry  and 
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spread  after  cell-free  viral  transmission,  suggest  that  changes  in  the  viral  envelope  important 
for  membrane  fusion  or  other  post-binding  events  are  involved.  We  have  also  isolated  from 
a  single  subject  on  two  occasions  a  strain  of  HIV-1  that,  like  HIV-2/ST,  was  poorly 
transmissible  and  required  repeated  PEG  precipitations  of  culture  supernatants  for  successful 
amplification.  Mullins  and  co-workers  (118)  have  recently  described  a  replication  defective 
strain  of  feline  leukemia  virus  (FeLV)  whose  variant  env/LTR  region  is  actually  responsible 
for  enhanced  virulence  and  they  have  suggested  that  biologically  important  strains  of  human 
immunodeficiency  viruses  could  have  similarly  altered  genetic  and  biologic  properties.  This 
report  of  an  attenuated  form  of  HIV-2  is  an  example  of  such  a  naturally-occurring  HIV  variant. 

The  significance  of  different-sized  env  transmembrane  glycoproteins  in  HIV-2/ST  is 
unknown.  Biologically-cloned  cultures  of  HIV-2/ST  with  either  large  (42  kD)  or  small  (30kD) 
TMPs  were  both  infectious,  replication-competent,  and  non-cytopathic  as  were  isolates  of 
SIVMAC  and  HIV-2/ROD  having  only  small  TMPs.  These  data  suggest  that  a  full-length  (gp  42) 
TMP  is  not  required  for  any  of  these  biological  functions,  although  the  carboxyterminus  of  the 
TMP  reading  frame,  which  is  highly  conserved  among  HIV-1,  HIV-2,  and  SIVMAC,  could 
potentially  encode  a  separate  protein  with  distinct  biologic  function. 

Considerable  controversy  has  surrounded  the  question  as  to  what  types  of  human 
immunodeficiency  viruses,  besides  HIV-1,  are  present  in  West  Africa.  HIV-2/ST  was  shown 
by  differential  nucleic  acid  hybridization,  restriction  mapping,  and  analysis  of  core  proteins  to 
be  considerably  more  related  to  the  prototype  HIV-2/ROD  virus  than  to  HIV-1  or  SIVMAC.  There 
was  no  evidence  in  any  of  the  primary  or  secondary  cultures  of  HIV-2/ST,  or  in  cultures  from 
the  other  three  Senegalese  subjects  similarly  studied,  for  the  presence  of  an  SIVMAC-like  virus. 
Because  of  the  near  identity  among  isolates  of  HTLV-4  and  SIVMAC-251  at  the  genetic  level, 
the  failure  to  reisolate  similarly  conserved  SIVMAC-like  viruses  from  West  Africans,  and  the 
genera!  finding  of  genotypic  diversity  among  independent  isolates  of  HIV-1,  HIV-2,  and  SIVMAC, 
we  conclude  that  the  HIV-2  viruses  typified  by  HIV-2/ROD,  HIV-2/SBL6669,  and  HIV-2/ST, 
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along  with  HIV-1,  represent  the  naturally  occurring  human  immunodeficiency  viruses  in  West 
Africa. 

As  part  of  our  studies  involving  the  genetic  and  biologic  characterization  of  HIV-1  and 
HIV-2  viruses,  we  also  focused  attention  on  unique  features  of  central  region  genes  that 
distinguish  HIV-1  from  HIV-2.  Nucleotide  sequence  comparisons  of  HIV-1,  HIV-2  and  SIVMAC 
demonstrated  an  overall  highly  conserved  genomic  organization,  represented  by  LTR-gag-pol- 
vif-central  region-env-3‘orf-LTR  (3,22,63,114,130,131,139,156).  HIV-2  and  SIVMAC  are 
approximately  70%  homologous  at  the  nucleotide  sequence  level  and  each  is  approximately 
45%  similar  to  HIV-1  (22,63).  Interestingly,  there  is  an  additional  open  reading  frame  in  the 
genomes  of  HIV-2  and  SIVMAC  that  is  missing  in  HIV-1.  This  open  reading  frame,  termed  X, 
is  situated  in  the  central  viral  region  between  vif  and  vpr  genes  and  partially  overlaps  the  vif 
open  reading  frame  on  its  5’  end.  The  deduced  amino  acid  sequence  of  X  predicts  a  protein 
of  112  amino  acids  and  14.5kD  calculated  molecular  weight  and  is  conserved  in  94  of  112 
amino  acid  residues  between  HIV-2  and  SIVMAC  (Figure  25).  In  this  study  we  chemically 
synthesized  HIV-2  and  SIVMAC  X  specific  oligopeptides,  raised  heterologous  antisera  to  these 
peptides  in  rabbits,  and  expressed  the  SIVMAC  X  open  reading  frame  as  TrpE  fusion  proteins 
in  E.  coli.  With  these  reagents,  we  demonstrated  that  X  encodes  a  novel  retroviral  protein 
uniquely  present  in  HIV-2  and  SIVMAC  and  provided  evidence  for  its  expression  and 
immunogenicity  in  vivo. 

Two  synthetic  peptides  encompassing  1 9  residues  from  the  N-terminus  of  the  deduced 
X  amino  acid  sequences  of  HIV-2/ROD  and  SIVMAC/PK82  (underlined  in  Figure  25),  were 
synthesized  and  used  to  generate  rabbit  immune  sera.  The  peptides  were  prepared  by  solid 
phase  methodology  on  an  Applied  Biosystems  430-A  peptide  synthesizer  using  a 
paramethylbenzhydrolamine  resin  (90).  Synthetic  peptides  were  coupled  to  keyhole  limpet 
hemocyanin  (KLH)  at  the  carboxyterminus,  after  the  addition  of  two  glycine  spacers  and  a 
cysteine  residue.  New  Zealand  white  rabbits  were  immunized  with  Img  of  KLH-coupled 
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peptides  emulsified  1 :1  in  Freunds  complete  adjuvant,  boosted  twice  at  two  week  intervals  with 
the  same  peptides  mixed  1:1  with  Freunds  incomplete  adjuvant,  and  bled  six  weeks  after  the 
first  immunization  to  collect  immune  sera.  A  peptide  ELISA  with  non-KLH-coupled 
oligopeptides  as  antigen  was  performed  to  confirm  high  titer  antipeptide  antibodies  in  these 
sera  (47). 

The  SIVMAC  X  orf  was  also  expressed  as  a  fusion  protein  in  E.  coli.  An  Xho-ll/Hindlll 
fragment  containing  the  entire  SIVMAC  X  open  reading  frame  minus  the  first  six  nucleotides 
(SIVMAC  clone  PKE  102,  ref.  26)  was  subcloned  into  the  bacterial  expression  vector  pATH 
(152),  immediately  adjacent  and  in-frame  with  the  bacterial  Trp-E  gene  (pATH-X,  Figure  26A). 
£.  coli  (HB101)  transformed  with  pATH-X  were  treated  in  mid-log  phase  growth  with  Indole 
acrylic  acid  to  induce  the  trp  operon  (116).  Cell  extracts  were  subjected  to  polyacrylamide  gel 
electrophoresis,  and  a  predominant  protein  of  approximately  50kD,  consistent  with  the  predicted 
molecular  weight  of  a  TrpE/X  fusion  protein,  was  identified  by  Coomassie  blue  staining  of 
sodium  dodecyl  sulfate  polyacrylamide  gels.  A  second  plasmid  construct  was  prepared  by 
deleting  an  internal  Nrul/EcoRI  fragment  in  pATH-X,  which  removed  95%  of  the  TrpE  coding 
sequence  (Figure  26A).  This  plasmid  retained  only  16  amino  acid  residues  derived  from  the 
TrpE  gene  on  the  N-terminus  and  resulted  in  an  expressed  X  fusion  protein  of  15kD  (delta 
TrpE/X).  Both  the  TrpE/X  fusion  protein  and  the  TrpE  deleted  X  fusion  protein  comprised 
approximately  20%  of  the  total  cellular  protein  as  judged  by  Coomassie  blue  staining  of  gel 
electrophoresed  bacterial  lysates.  Sera  from  rabbits  immunized  with  the  SIVMAC  and  HIV-2  X 
oligopeptides  (Figure  26B,  panels  B  and  D,  respectively)  reacted  strongly  with  these  lysates 
on  Western  blots  (TrpE/X  lanes  2;  delta  TrpE/X  lanes  3),  but  no  reactivity  was  detectable  with 
the  corresponding  pre-immune  sera  (panels  A  and  C),  or  with  lysates  of  cultures  transformed 
with  the  non-recombinant  vector  (lanes  1).  An  anti-TrpE  antiserum,  used  for  control,  detected 
both  the  37kD  TrpE  protein  alone  as  well  as  the  50kD  TrpE/X  fusion  protein,  but  did  riot 
recognize  the  15kD  TrpE  deleted  X  protein  (panel  E). 


38 


In  order  to  identify  the  putative  X  protein  in  SIVMAC  and  HIV-2  infected  cells  and  cell- 
free  virions,  immunoblotting  techniques  were  used  to  probe  viral  preparations  for  reactivity  with 
the  rabbit  immune  sera.  Figure  27A  shows  the  Western  blot  patterns  of  viral  preparations  from 
cultures  infected  with  HIV-1  (lanes  1),  HIV-2  (lanes  2),  SIVMAC  (lanes  3),  as  well  as  mock- 
infected  cells  (lanes  4).  A  polyclonal  human  anti-HIV-1  serum  (RR)  and  a  polyclonal  human 
anti-HIV-2  serum  (ST)  were  used  for  control  to  confirm  the  presence  of  virus  specific  antigens. 
Probing  with  the  X-specific  rabbit  immune  sera  (anti-SIVMAC,  panel  B;  anti-HIV-2,  panel  D) 
resulted  in  the  identification  of  a  14  and  12kD  viral  protein  in  HIV-2  and  SIVMAC  viral 
preparations,  respectively,  but  not  in  similarly  prepared  preparations  of  HIV-1  or  virally- 
uninfected  control  cultures.  Similar  results  were  obtained  using  whole  cell  lysates  infected  with 
these  viruses.  No  reactivity  was  detected  with  the  preimmune  sera  (panels  A  and  C). 
Interestingly,  sera  from  rabbits  immunized  with  the  HIV-2  X  peptide  recognized  only  the 
homologous  HIV-2  X  protein  (panel  D),  while  the  SIVMAC  X  peptide  antisera  recognized  both 
the  HIV-2  and  the  SIVMAC  X  protein  (panel  B).  The  SIVMAC  X  protein  appeared  to  be  slightly 
smaller  as  compared  to  HIV-2  and  was  reproducibly  present  in  considerably  less  amounts. 
The  significance  of  these  differences  is  presently  under  study. 

In  order  to  determine  whether  the  X  protein  was  actually  virion-associated,  HIV-2/ST 
virus  was  first  concentrated  by  ultracentrifugation  and  then  further  purified  on  a  20%  to  60% 
continuous  sucrose  gradient  prior  to  Western  blot  analysis.  As  shown  in  Figure  27B,  HIV-2 
X  peptide  immune  sera  (D)  detected  the  identical  14kD  X-protein  on  immunoblots  of  banded 
HIV-2  virions.  The  intensity  of  this  X  specific  band  was  equal  to  slightly  greater  as  compared 
to  cell  lysates  infected  with  comparable  amounts  of  HIV-2.  A  human  serum  (ST)  from  a 
healthy  HIV-2  infected  individual,  and  the  preimmune  rabbit  antiserum,  failed  to  detect  the  X 
protein  in  the  same  antigen  preparation.  These  data  were  confirmed  with  a  second  HIV-2 
isolate  (HIV-2/ROD). 

Identification  of  the  12-14kD  protein  as  the  virally  encoded  X  gene  product  was  further 
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established  by  competitive  adsorption  assays  {Figure  27C).  SIVMAC  and  HIV-2  X  immune  sera 
were  adsorbed  with  nitrocellulose-immobilized  lysates  of  E.  coli  transformed  either  with  the 
pATH  cloning  vector  or  with  the  pATH-X  recombinant  plasmid.  Adsorption  of  the  SIVMAC  X 
immune  sera  with  the  50kD  TrpE/X  fusion  protein  resulted  in  a  complete  loss  of  antibody 
reactivity  with  both  HIV-2  (panel  B’.lane  2)  and  SIVMAC  (panel  B‘,  lane  3)  viral  preparations. 
Preadsorption  of  the  same  sera  with  E.  coli  lysates  transformed  with  the  pATH  vector  alone 
gave  hybridization  signals  equal  to  that  in  Figure  27A,  panel  B  (not  shown).  The  reactivity  of 
HIV-2  X  peptide  antisera  (D1)  was  only  slightly  diminished  by  preadsorption  with  the  SIVMAC 
TrpE/X  fusion  protein,  as  expected,  since  this  antiserum  had  failed  to  recognize  the 
heterologous  X  protein  on  previous  immunoblots  (compare  Figure  27A,  lane  3,  panel  D). 
Taken  together,  these  results  confirmed  the  12-14kD  proteins  as  the  HIV-2  and  SIVMAC  virally 
encoded  X  proteins. 

To  evaluate  whether  the  SIVMAC  X  protein  synthesized  in  E.  coli  could  be  of  diagnostic 
value,  we  examined  a  limited  number  of  patient  and  normal  control  sera  for  reactivity  with  this 
protein  (Figure  28).  Four  sera  from  healthy  West  African  HIV-2  infected  individuals  (a-d),  four 
sera  from  AIDS  patients  seropositive  for  HIV-1  (e-h),  and  two  healthy  control  sera  (i-j)  were 
tested  side-by-side  for  reactivity  to  the  50kD  TrpE/X  protein  (lanes  2),  the  15kD  TrpE  deleted 
X  protein  (lanes  3),  and  lysates  of  bacteria  transformed  with  the  non-recombinant  pATH  vector 
(lanes  1).  Out  of  four  HIV-2  antibody  positive  sera  tested,  two  contained  antibodies  reactive 
with  both  the  50kD  TrpE/X  and  the  15kD  delta  TrpE/X  protein  (panels  a  and  c).  These  sera 
titered  to  a  dilution  of  1:50.  Four  HIV-1  positive  sera  and  two  HIV  negative  control  sera  did 
not  react  with  these  X  specific  fusion  proteins  (panels  e-j).  Bands  corresponding  to  bacterial 
proteins  other  than  the  X  fusion  proteins  were  also  observed,  particularly  with  the  West  African 
sera.  However,  these  were  easily  distinguishable  from  X-specific  reactivities  by  their  size  and 
occurrence  in  the  control  antigen  preparations  (lanes  1). 

In  summary,  the  data  demonstrate  that  HIV-2  and  SIVMAC  viruses  encode  a  novel 
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retroviral  protein,  termed  X,  which  is  expressed  in  virus  cultures  in  vitro  and  in  naturally- 
infected  humans,  in  vivo.  The  molecular  size  of  this  protein,  12-14kD,  approximates  that 
calculated  from  the  deduced  amino  acid  sequence,  although  the  actual  structure  and 
organization  of  the  X  gene  and  its  protein,  and  the  mode  of  X  gene  transcription  in  HIV-2  and 
SIVMAC  viruses,  cannot  be  predicted.  Similarly  unknown  at  the  present  time  is  the  function  of 
the  X  protein  in  the  life  cycle  of  HIV-2  and  S1VMAC.  The  high  degree  of  sequence  conservation 
suggests  a  significant  function  for  X  in  both  viruses.  The  fact  that  HIV-1  lacks  the  X  gene 
entirely,  yet  causes  disease  in  infected  individuals,  implies  that  X  gene  expression  is  not 
required  for  induction  of  cytopathicity  in  vitro  or  in  vivo.  A  search  for  X-related  protein 
sequences  in  the  NBRF  protein  data  bank  identified  several  proline-rich  regions  present  in 
retroviral  proteins,  adenovirus,  papillomavirus,  human  collagen,  and  a  group  of  human 
phosphoproteins  to  share  homology  with  the  carboxyterminus  of  the  X  gene.  In  addition,  a 
probable  nuclear  antigen  of  Epstein-Barr  virus  (9)  showed  10  of  14  conserved  amino  acids  with 
a  region  of  X  just  5'  of  the  polyproline  tract  {GRGhrGprGGwgRPGpaP).  The  significance  of 
these  homologies  is  under  study. 

The  presence  of  antibodies  specific  for  X  in  some  HIV-2  infected  individuals 
demonstrates  both  its  immunogenicity  and  in  vivo  expression.  It  will  be  important  to  determine 
if  X  specific  reagents  will  be  generally  useful  in  distinguishing  between  HIV-2  and  HIV-1 
infection,  whether  expression  of  X  in  vivo  influences  viral  pathogenesis  and  clinical  outcome, 
and  whether  titers  of  X  specific  antibodies  will  provide  prognostically  important  clinical 
information.  The  availability  of  X  specific  reagents  that  we  have  developed  as  part  of  the 
current  DoD  contract  should  facilitate  experiments  addressing  these  questions. 

5.  To  determine  the  molecular  basis  for  attenuation  of  cytopathicity  in  certain 

strains  of  HiV-2. 

HIV-1  and  HIV-2  represent  two  distinct  groups  of  human  immunodeficiency  viruses 
known  to  cause  AIDS  in  infected  individuals  (12,26,27,57,125).  While  HIV-1  is  the  causative 
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agent  of  epidemic  AIDS  worldwide,  HIV-2  has  generally  been  restricted  to  West  Africa. 
Numerous  isolates  of  HIV-1  and  HIV-2  have  been  obtained  and  their  biological  and  molecular 
properties  characterized  (2,12,27,63,215,167).  Nucleotide  sequence  analysis  shows  that  HIV-2 
is  only  distantly  related  to  HIV-1  (63),  while  it  is  more  closely  related  to  two  primate 
retroviruses,  SIVMAC  and  SIVSM,  which  cause  an  AIDS-like  disease  in  captive  macaques 
(22,35,77).  Although  genetically  divergent,  prototype  HIV-1  and  HIV-2  viruses  share  similar 
overall  genomic  organization  and  have  similar  biological  properties,  which  include  their 
propensity  for  rapid  genetic  change  (52,135,167),  their  host  cell  tropism,  their  cytopathic  effect 
on  T-cell  cultures  and  peripheral  blood  mononuclear  cells  in  vitro,  and  their  ability  to  form 
syncytia  with  CD4-bearing  target  cells  (27,125).  In  fact,  the  majority  of  HIV-1  and  HIV-2 
strains  isolated  from  patients  with  immunodeficiency  disease  have  been  shown  to  cause  cell 
fusion  and  the  formation  of  multinucleated  giant  cells  in  culture.  This  represents  a  hallmark 
of  productive  viral  infection  in  tissue  culture  and  accounts  for  the  profound  in  vitro  cytopathic 
effects  of  HIV  (73,103,165).  In  contrast  to  HIV-1  and  HIV-2  isolated  from  clinically  ill 
individuals,  natural  infection  of  African  green  monkeys  by  SIV  occurs  with  high  prevalence  in 
East  and  West  Africa  but  does  not  lead  to  immunodeficiency.  In  addition,  there  is  growing 
evidence  that  the  biologic  properties  of  HIV-1  in  vivo  may  differ  significantly  from  tissue  culture 
adapted  strains.  Thus,  in  order  to  characterize  HI  Vs  and  SI  Vs  as  they  exist  in  vivo,  our  goals 
were:  (i)  to  characterize  by  genetic  and  biologic  analysis  an  HIV-2  strain  shown  to  have 
attenuated  virulence  in  vitro  and  in  vivo ;  (ii)  to  develop  PCR  approaches  that  would  allow,  for 
the  first  time,  the  direct  identification,  cloning,  and  sequence  analysis  of  SIV  and  HIV  from 
uncultured  blood  of  West  African  green  monkeys  and  humans  (all  previous  clones  of  SIVAGM 
have  been  derived  from  East  African  green  monkeys);  (iii)  to  obtain  by  direct  lambda  phage 
cloning  full-length  transfection  competent  clones  of  HIV-1  from  uncultured  human  brain  tissue. 

In  contrast  to  the  prototype  pathogenic  HIV-1  and  HIV-2  viruses,  we  and  others  have 
recently  isolated  less  cytopathic  strains  of  HIV-1  and  HIV-2  which  exhibit  markedly  different 
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biological  properties.  These  particular  isolates  cause  little  or  no  cell  death  in  susceptible  target 
cells,  fail  to  induce  cell  fusion  with  CD4-bearing  immortalized  T-ceil  lines,  exhibit  a  restricted 
host  cell  tropism  with  a  preference  for  PBMC  and/or  macrophages,  and  are  often  derived  from 
asymptomatic  individuals.  While  their  in  vitro  biological  differences  were  well-documented,  the 
genetic  changes  responsible  for  their  attenuated  phenotype  were  not  understood.  In  order  to 
elucidate  determinants  of  HIV  pathogenicity,  we  have  thus  begun  to  molecularly  dissect  a  non- 
fusogenic  and  non-cytopathic  HIV-2  isolate,  termed  HIV-2/ST,  which  was  obtained  from  a 
healthy  Senegalese  prostitute  as  part  of  this  contract.  Although  this  virus  replicated  to  high 
titers  in  tissue  culture,  it  infected  cells  at  a  slower  rate  compared  to  cytopathic  strains  of  HIV-1 
and  HIV-2,  and  caused  little  or  no  cell  killing  and  fusion.  This  was  the  case  despite  the  fact 
that  its  external  envelope  glycoprotein  was  cleaved  correctly,  transported  to  the  cell  surface, 
and  shown  to  bind  to  a  specific  epitope  on  CD4,  which  was  recognized  by  OKT4a  but  not 
OKT4  antibodies.  HIV-2/ST  therefore  appeared  to  bind  to  the  CD4  molecule  analogous  to 
other  HIVs,  but  it  failed  to  fuse  with  CD4-bearing  target  cells  suggesting  that  its  infectivity  was 
greatly  retarded  at  the  level  of  cell  entry  (see  section  4  above). 

Since  HIV  ''isolates"  generally  represent  complex  mixtures  of  genotypically-distinct 
viruses,  and  since  the  biological  phenotype  of  any  HIV  culture  depends  on  the  sum  of  the 
properties  of  each  genotypic  variant  (61,135),  we  first  attempted  to  isolate  a  molecular  clone, 
which  was  both  transfection-competent  and  representative  of  the  in  vitro  properties  of  its 
parental  virus.  We  therefore  obtained  three  full-length  proviral  clones  (XJSP4-27,  AJSP4-32, 
and  XJSP4-34)  from  a  genomic  library  of  a  biologically-cloned  high  producer  cell  line,  termed 
ST/BI 2,  and  subsequently  transfected  them  into  the  neoplastic  T-cell  lines  SupTI  (147)  and 
CEMx174  (82,138).  Reverse  transcriptase  activity  was  detected  in  supernatants  of  cultures 
transfected  with  XJSP4-27  as  early  as  five  days  post-transfection,  while  ?JSP4-32  and 
XJSP4-34  transfected  cultures  revealed  no  signs  of  viral  replication,  indicating  that  these 
proviruses  were  replication-defective.  Immunofluorescence  analysis  further  confirmed  the 
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presence  of  virus  expressing  cells  in  XJSP4-27  transfected  cultures,  but  failed  to  identify  virus 
mediated  cell  fusion.  Western  blot  analysis  of  purified  JSP4-27  virions  demonstrated  a  protein 
profile  similar  to  that  of  the  parental  HIV-2/ST  virus.  To  facilitate  subsequent  transfection 
experiments  and  to  allow  the  direct  comparison  of  JSP4-27  to  other  transfection-competent 
HIV-2  plasmid  constructs,  we  sub-cloned  the  proviral  insert  of  XJSP4-27  into  the  plasmid  vector 
pSP65  (Figure  29A). 

To  test  whether  the  transfection-derived  JSP4-27  virions  were  infectious,  filtered 
supernatants  of  plasmid-transfected  cultures  were  transmitted  to  uninfected  SupTI  and 
CEMx174  cells.  The  results  showed  that  cell-free  transmission  of  JSP4-27  virions  was  readily 
and  reproducibly  demonstrable.  However,  infection  and  spread  in  culture,  particularly  in  SupTI 
cells,  occurred  slowly  and  with  considerable  delay.  These  results  were  confirmed  and 
extended  in  comparative  studies  with  a  transfection-derived,  cytopathic  HIV-2/ROD  strain, 
termed  SL1  (101).  While  transfection  of  the  SL1  provirus  resulted  in  >90%  infected  SupTI 
or  CEMx174  cultures  within  three  to  four  days  post-transfection,  JSP4-27  transfected  cultures 
reached  only  10%  infectivity  in  the  same  time  period,  which  indicated  the  same  reduced  ability 
to  spread  in  culture  that  had  been  observed  for  the  parental  HIV-2/ST  virus  (93).  Similarly, 
the  transfection-derived  JSP4-27  cultures  did  not  form  syncytia  upon  cocultivation  with  several 
CD4-bearing  T-cell  lines,  including  SupTI,  CEM,  H9  and  CEMx174  cells,  whereas  SL1  infected 
cultures  produced  numerous  and  large  syncytia,  as  well  as  a  profound  cytopathic  effect,  with 
these  same  target  cells  (Figure  29B).  These  data  thus  showed  that  the  JSP4-27  provirus  was 
replication-competent  and  infectious,  and  exhibited  the  same  non-fusogenic  and  non-cytopathic 
properties  as  previously  described  for  the  parental  virus  (93). 

Having  identified  and  characterized  the  biological  features  of  a  molecular  clone  of 
HIV-2/ST,  we  next  sequenced  its  entire  genome.  The  complete  nucleotide  sequence  of  the 
JSP4-27  provirus  is  depicted  in  Figure  30.  The  viral  genome  is  9,672  bp  in  length  and 
exhibits  an  overall  genomic  organization  of  5’LTR-gag-pol-central  region-env-nef-3'LTR,  which 
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is  identical  to  that  of  other  cytopathic  HIV-2  and  SIVMAC  proviruses.  It  contains  all  major  open 
reading  frames  characteristic  for  HIV-2,  including  vpx  which  is  present  in  HIV-2  and  SIVMAC 
viruses  but  not  in  HIV-1,  and  vpr  which  is  present  in  HIV-1,  HIV-2  and  SIVMAC  but  not  in  SIV^ 
viruses  (55).  Like  other  HIV/SIV  proviruses,  HIV-2/ST  is  flanked  by  LTR  sequences  which  are 
known  to  regulate  viral  gene  expression.  Sequence  comparison  with  other  HIV-2  LTRs 
showed  that  regulatory  elements,  like  the  tata  box,  the  polyadenylation  site,  core  enhancer 
sequences,  Spl  binding  sites,  and  the  tat  responsive  region  are  all  present  in  HIV-2/ST  and 
that  their  sequences  are  highly  conserved.  The  HIV-2/ST  LTR  is  of  similar  length,  and  there 
are  no  major  deletions  or  insertions  which  would  distinguish  it  from  the  LTRs  of  other 
cytopathic  HIV-2  viruses  (data  not  shown). 

Comparison  of  the  deduced  amino  acid  sequence  of  the  HIV-2/ST  reading  frames 
suggested  that  they  all  encoded  full-length  and  functional  proteins,  with  the  exception  of  the 
vpr  gene.  This  open  reading  frame  was  found  to  contain  an  in-frame  TAA  stop  codon  which 
truncates  the  vpr  protein  prematurely  after  the  first  32  amino  acid  residues.  Since  the  JSP4-27 
provirus  is  fully  replication-competent,  it  can  be  concluded  that  the  vpr  gene  product  is  not 
required  for  in  vitro  replication  of  HIV-2.  This  conclusion  was  confirmed  by  the  biological 
analysis  of  a  second  vpr-deficient  HIV-2  provirus  independently  constructed  in  our  laboratory, 
as  well  as  by  the  findings  of  others  (37).  Moreover,  since  vpr-deficient  proviruses  of  HIV-2 
are  also  cytopathic  and  fusogenic,  it  is  unlikely  that  the  lack  of  a  functional  vpr  gene  in 
HIV-2/ST  is  responsible  for  its  attenuated  phenotype. 

Pairwise  sequence  alignments  of  JSP4-27  to  other  cytopathic  strains  of  HIV-2  similarly 
revealed  no  genetic  features  unique  for  HIV-2/ST.  Comparison  of  HIV-2/ST  and  HIV-2/ROD 
demonstrated  an  overall  sequence  divergence  of  11%,  which  is  within  the  expected  range  of 
genetic  variability  observed  among  geographically-distant  isolates  of  HIV-2  (Figures  31,32). 
Three  other  recently  reported  HIV-2  viruses,  HIV-2/ISY  derived  from  a  Gambian  individual  with 
AIDS  (2,52),  HIV-2/NIHz  derived  from  an  AIDS  patient  from  Guinea  Bissau  (167),  and 
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HIV-2/GH  derived  from  an  AIDS  patient  from  Ghana  (72,83)  differ  from  HIV-2/ROD  (Cape 
Verde  Islands,  refs.  27  and  63)  by  11%,  12%,  and  12%,  respectively.  Among  all  these 
viruses,  the  Senegalese  HIV-2/ST  virus  was  found  to  be  most  closely  related  to  the  Gambian 
isolate  HIV-2/ISY,  which  shared  90%  of  its  nucleotide  sequence  with  HIV-2/ST. 

Since  infectivity,  syncytia  formation,  and  ceil  fusion  are  viral  properties  which  are 
mediated  by  the  viral  env  gene,  we  examined  this  gene  in  particular  with  respect  to  sequence 
differences  unique  for  HIV-2/ST.  An  alignment  of  the  deduced  HIV-2/ST  env  sequence  to 
those  of  six  other  cytopathic  and  fusogenic  HIV-2  and  SIV  viruses  is  depicted  in  Figure  33. 
Overall,  the  size  of  the  various  env  sequences  compared  is  approximately  the  same.  In 
contrast  to  other  HIV-2  and  SIVMAC  viruses,  JSP4-27  contains  no  in-frame  stop  codon  in  its 
transmembrane  envelope  domain  (Figure  33).  This  is  consistent  with  the  presence  of  a  43kD 
rather  than  a  32kD  transmembrane  glycoprotein  on  Western  blots  of  JSP4-27  derived  virions 
(Figure  29B),  and  is  distinct  from  the  protein  profile  of  the  HIV-2/ST  parental  strain  which 
comprises  a  mixture  of  viruses  with  both  full-length  as  well  as  truncated  transmembrane 
proteins  (93).  Pairwise  sequence  alignment  shows  that  the  HIV-2/ST  env  sequence  differs 
from  other  HIV-2  and  SIV  envelopes  to  the  same  degree  as  they  differ  from  each  other,  with 
amino  acid  sequences  varying  between  16%  and  30%  (Figure  31).  25  of  32  cysteine 

residues,  including  22  of  23  located  in  the  extracellular  domain,  are  conserved  among  all 
viruses,  which  indicate  a  highly  conserved  envelope  structure.  In  addition,  HIV-2/ST  contains 
28  potential  N-linked  glycosylation  sites  which  are  arranged  in  a  pattern  similar  to  that  of 
other  viruses,  and  which  also  include  one  highly  conserved  glycosylation  site  previously  shown 
to  be  critical  for  HIV-1  infectivity  (162).  Finally,  the  HIV-2/ST  env  gene  contains  highly  variable 
regions  which  correspond  closely  in  distribution  and  size  to  similar  hypervariable  regions  in  the 
other  env  gene  sequences. 

Although  a  three-dimensional  structure  has  not  been  determined  for  any  HIV  or  SIV 
envelope  glycoproteins,  there  are  certain  envelope  domains  whose  functions  have  been 
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characterized  by  mutagenesis  analysis.  These  include  the  putative  CD4  binding  domain 
(95,100),  the  envelope  precursor  cleavage  site  (110),  and  the  viral  fusion  sequence 
(16,18,56,95).  Since  sequence  changes  in  any  one  of  these  domains  could  alter  the  fusogenic 
properties  of  a  virus  (95),  we  analyzed  the  envelope  sequence  of  HIV-2/ST  for  particular 
mutations  in  these  areas.  No  changes,  or  only  conservative  amino  acid  changes  were  found 
in  an  envelope  domain  of  JSP4-27  that  corresponds  to  the  HIV-1  envelope  region  previously 
identified  to  be  involved  in  CD4  binding  (100).  HIV-2/ST  also  contained  an  apparently 
functional  primary  envelope  precursor  cleavage  site,  with  a  recognition  sequence  (RNKR) 
identical  to  that  of  three  other  fusogenic  HIV-2  or  SIV  viruses  (compare  Figure  33).  In  contrast 
to  these  viruses,  however,  HIV-2/ST  was  found  to  differ  in  2  of  16  highly  conserved  amino 
acid  residues  at  the  N-terminus  of  the  transmembrane  envelope  glycoprotein  which,  as  shown 
by  site-directed  mutagenesis,  contains  the  viral  fusion  domain  (16).  The  mutations  include  an 
alanine  to  threonine  change  involving  amino  acid  residue  517  (position  12  after  the  cleavage 
site),  and  a  serine  to  alanine  change  involving  amino  acid  residue  521  (position  16  after  the 
cleavage  site).  Only  one  other  fusogenic  HIV-2  virus,  HIV-2/ISY,  contained  these  same 
changes.  However,  this  virus  exhibited  three  additional  mutations  in  this  same  envelope  area 
(compare  Figure  33).  Since  the  fusion  domain  is  generally  highly  conserved  among  cytopathic 
HIV  and  SIV  viruses  (16),  we  considered  the  possibility  that  the  non-fusogenic  properties  of 
HIV-2/ST  resulted  from  these  mutations. 

To  determine  if  the  observed  amino  acid  substitutions  in  the  HIV-2/ST  envelope  fusion 
region  were  likely  responsible  for  the  impaired  cytopathic  properties  of  this  virus,  we  examined 
two  fusogenic  variants  of  HIV-2/ST,  termed  ST/24.1  C  and  ST/24 ,2C  (Figure  34).  Both 
fusogenic  strains  were  originally  derived  from  a  biologically-cloned  sub-culture  of  HIV-2/ST, 
termed  ST/24,  which  produced  non-cytopathic  and  non-fusogenic  virions  biologically 
indistinguishable  from  the  parental  HIV-2/ST  isolate.  Following  serial  cell-free  transmissions 
of  ST/24  supernatant  to  uninfected  SupTI  cells,  large  and  numerous  syncytia  were  observed 
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on  two  independent  occasions,  which  indicated  the  emergence  of  fusogenic  progeny  virus  in 
the  culture.  Two  cell  lines  were  subsequently  established  (ST/24.1  C  and  ST/24.2C)  and  shown 
to  produce  virions  with  fusogenic  and  cytopathic  properties  similar  to  prototype  HIV-1  and 
HIV-2  viruses.  Moreover,  these  cell  lines  were  confirmed  to  be  infected  with  HIV-2/ST  by 
Southern  blot  analysis,  which  revealed  no  changes  in  their  BamHI,  Nhel,  Hindlll,  and  Pstl 
cleavage  patterns  as  compared  to  ST/24.  In  order  to  identify  the  molecular  basis  for  the 
phenotypical  change  in  these  variants  and  to  determine  whether  a  direct  mutation  of  the  viral 
fusion  sequence  had  occurred,  we  amplified  the  envelope  fusion  domain  of  these  cultures 
using  the  polymerase  chain  reaction  (136). 

Two  oligonucleotide  primers  (30  mers)  were  designed  to  allow  the  amplification  of  a  544 
bp  envelope  fragment  from  virally-infected  cellular  DNA,  which  included  the  putative  precursor 
cleavage  site  as  well  as  the  envelope  fusion  region  (Figures  35  and  36).  Both  primers  were 
synthesized  according  to  the  JSP4-27  sequence,  however,  sequence  changes  were  introduced 
so  as  to  accommodate  a  BamHI  site  in  the  5‘  amplimer  and  a  Pst  I  site  in  the  3’  amplimer 

(primer  V.  S' AGAATTGGGGGAT CCTAAATT GATAGAAGT  3’;  and  primer  2:  5'GCTATTTAATTTCTGCAGTTCATACATGTT  3'). 

Total  genomic  DNA  of  ST/24,  ST/24.1C,  ST/24.2C,  as  well  as  ST/B12  as  a  control,  was 
amplified  using  these  primers.  100  p.1  of  reaction  mixture  contained  10  mM  Tris-HCI,  pH  8.3, 
50  mM  KCI,  1.5  mM  MgCI2,  0.01%  gelatine,  200  mM  of  each  deoxynucleotide  triphosphate 
(dNTP),  10  pM  of  each  primer,  2.5  U  of  Taq  polymerase  and  1  }.ig  of  high  molecular  weight 
DNA.  Samples  were  subjected  to  45  amplification  cycles,  consisting  of  a  denaturing  step  at 
94°C  for  90  seconds,  a  primer-annealing  step  at  50°C  for  90  seconds,  and  a  primer  extension 
step  at  72°C  for  135  seconds.  Amplified  envelope  fragments  were  purified,  cleaved  with 
BamHI  and  Pstl,  and  subsequently  cloned  into  Ml 3.  Ten  individual  M13  clones  per  amplified 
DNA  preparation  were  then  isolated,  and  each  clone  was  sequenced  in  the  region,  which 
corresponded  to  the  N-terminus  of  the  transmembrane  envelope  domain.  An  alignment  of 
these  sequences  is  depicted  in  Figure  35. 
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All  10  M13  clones  derived  from  ST/BI 2  contained  sequences  identical  to  JSP4-27, 
which  indicated  that  the  PCR  amplification  procedure  was  reliable  and  did  not  cause  frequent 
misincorporations  of  nucleotides  in  this  particular  DNA  template.  Sequence  comparison  of 
amplified  fragments  from  cell  line  ST/24  demonstrated  no  differences  among  the  individual  Ml 3 
clones,  but  revealed  4  nucleotide  point  mutations  between  these  ST/24  sequences  and  the 
JSP4-27  reference  sequence.  In  fact,  all  ST/24  derived  viruses,  including  the  cytopathic  and 
fusogenic  ones,  exhibited  these  same  four  nucleotide  sequence  differences,  as  well  as  the 
threonine  and  alanine  substitutions  previously  identified  in  the  envelope  fusion  region  of 
JSP4-27.  The  results  thus  confirmed  that  the  observed  fusion  sequence  mutations  were 
representative  of  all  HIV-2/ST  strains,  regardless  of  their  origin  and  biological  phenotype. 
Interestingly,  eight  of  nine  Ml 3  clones  representing  ST/24.1  C  and  five  of  ten  clones 
representing  ST/24.2C  contained  additional  point  mutations,  which  predicted  three  amino  acid 
sequence  changes  with  respect  to  the  ST/24  sequence  (Figure  36).  The  presence  of  these 
mutations  within  the  amplified  material  identifies  the  ST/24.1  C  and  ST/24.2C  viral  strains  as 
mixtures  which  comprise  the  parental  ST/24  virus  as  well  as  additional  genotypic  variants. 
These  newly-generated,  genotypical  variants  must  be  responsible  for  the  phenotypical  changes 
seen  in  the  ST/24.1  C  and  ST/24.2C  cultures.  However,  the  genetic  changes  which  are  of 
biologically  significance  do  not  appear  to  occur  in  the  envelope  fusion  domain. 

While  these  studies  were  in  progress,  the  biological  properties  of  the  HIV-2/ST  envelope 
gene  products  were  also  analyzed  in  a  eukaryotic  expression  system  (115).  Vaccinia  virus 
expressed  JSP4-27  envelope  glycoproteins  were  compared  to  that  of  prototype  HIV-1  and 
HIV-2  viruses.  While  the  processing,  expression,  and  transport  to  the  cell  surface  appeared 
to  be  unaltered,  vaccinia  virus-produced  JSP4-27  envelope  glycoproteins  failed  to  form  syncytia 
with  CD4-bearing  Hela  cells.  Since  no  other  HIV-2  proteins  were  produced  in  this  system, 
these  results  strongly  suggested  that  the  JSP4-27  envelope  glycoproteins  were  primarily 
responsible  for  its  non-fusogenic  and  non-cytopathic  phenotype  (115).  Based  on  these  results 
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as  well  as  the  sequence  data,  we  conclude  that  the  non-fusogenic  and  attenuated  properties 
of  HIV-2/ST  are  indeed  a  function  of  its  envelope  gene  products,  although,  the  causative 
genetic  defect  appears  not  to  involve  mutations  within  the  envelope  fusion  domain. 

There  are  several  mechanisms  other  than  a  direct  alteration  of  the  fusion  sequence  that 
could  result  in  the  biological  changes  observed  in  HIV-2/ST.  These  include  mutations  that 
affect  envelope/CD4  interactions,  mutations  that  cause  differences  in  envelope  glycosylation, 
mutations  that  require  additional  cell  surface  molecules  to  facilitate  virus-cell  fusion,  as  well  as 
mutations  that  reduce  the  stability  of  envelope  glycoprotein  complexes  on  the  cell  surface.  In 
fact,  several  naturally-occurring  as  well  as  genetically-engineered  immunodeficiency  viruses 
have  been  described,  that  are  altered  in  their  fusogenic  or  cytopathic  properties  because  of 
one  of  these  mechanisms.  For  example,  possible  differences  in  the  binding  affinity  of  HIV  and 
SIV  envelope  glycoproteins  to  the  CD4  receptor  have  been  suggested  by  the  finding  that  25- 
fold  more  soluble  CD4  is  necessary  to  block  infectivity  of  prototype  HIV-2  viruses  compared 
to  HIV-1  (25).  It  is  possible  that  the  HIV-2/ST  envelope  glycoprotein  binds  the  CD4  molecule 
with  an  even  lower  affinity,  which  would  be  expected  to  influence  subsequent  steps  of  viral 
entry,  including  membrane  fusion  and  penetration.  Another  mechanism  known  to  cause 
attenuation  of  virulence  in  naturally-occurring  retroviruses  involves  differences  in  post- 
translational  modifications  of  envelope  glycoproteins.  Mullins  and  coworkers  showed  that  the 
pathogenic  determinants  of  an  immunodeficiency-causing  FeLV-FAIDS  virus  were  dependent 
on  the  processing  of  particular  envelope  oligosaccharides  (126).  Since  HIV-2/ST  differs  in 
number  and  distribution  of  its  potential  N-linked  envelope  glycosylation  sites  from  other 
cytopathic  HIV-2’s,  and  since  size  differences  between  the  exterior  envelope  glycoproteins  of 
fusogenic  and  non-fusogenic  HIV-2/ST  strains  have  been  observed,  a  biologically-significant 
change  in  the  sugar  composition  of  the  HIV-2/ST  envelope  cannot  be  excluded.  Finally,  a 
requirement  of  accessory  molecules  for  virus-cell  fusion  represents  still  another  potential 
mechanism  to  influence  retroviral  cytopathicity.  Studies  involving  SIVMAC  recently  revealed  that 
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this  virus  has  a  restricted  host-cell  range  which  comprises  only  a  subset  oi  CD4+  T-cell  lines 
(82,92).  While  highly  infectious  and  cytopathic  for  HUT78  and  H9  cells,  SIVMAC  does  not  fuse 
with  CD4-bearing  SupTI  cells.  Moreover,  SIVMAC  infects  SupTI  cells  only  with  considerable 
delay.  It  is  thus  conceivable,  that  SIVMAC  requires  a  surface  molecule(s)  in  addition  to  CD4 
to  establish  a  productive  infection  in  certain  human  T-cell  lines.  Since  its  infection  kinetics  and 
lack  of  cytopathic  effect  in  SupTI  cells  very  much  resemble  those  of  HIV-2/ST,  it  is  not 
unreasonable  to  speculate  that  HIV-2/ST  might  similarly  require  an  additional  cell  surface 
molecule(s)  for  efficient  cell  fusion  or  penetration. 

The  availability  of  cytopathic  variants  of  HIV-2/ST  will  be  instrumental  for  future 
experiments  designed  to  define  the  exact  molecular  determinants  involved  in  HIV-2/ST 
attenuation.  Molecular  clones  representing  the  fusogenic  and  cytopathic  HIV-2/ST  strains  are 
expected  to  exhibit  much  less  genetic  divergence  with  respect  to  JSP4-27  than  do  unrelated 
HIV-2  viruses,  like  HIV-2/ROD  or  H1V-2/ISY.  Therefore,  a  comparative  sequence  analysis  is 
more  likely  to  identify  biologically-important  differences,  and  the  construction  of  chimeras 
between  attenuated  and  cytopathic  clones  will  be  greatly  facilitated.  The  fact  that  cytopathic 
and  fusogenic  ST/24  mutants  evolved  by  cell  free  passage  on  two  independent  occasions 
indicates  the  presence  of  strong  selective  pressures  for  cytopathic  and  fusogenic  viruses  in 
vitro.  It  is  possible  that  similar  pressures  are  also  present  in  vivo  which  may  favor  the 
emergence  of  more  virulent  strains  in  certain  HIV  infected  individuals  over  time. 

6.  To  develop  PCR  techniques  for  sensitive  and  specific  amplification  of  novel 
HIV/SIV  genomes  using  universal  primers  and  "nested"  amplification  and  to  apply 
this  to  the  detection  and  characterization  of  SiV  in  uncultured  blood  of  West 
African  green  monkeys  and  people. 

Wild  caught  African  green  monkeys  (AGM)  from  East  and  West  Africa  are  infected  with 
SIV  viruses  at  a  high  prevalence  rate  (30-60%)  in  contrast  to  Asian  macaques  which  are  not 
naturally  infected  (35,85-88).  Because  of  this,  it  has  been  speculated  that  AGMs  could  harbor 
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viruses  that  served  as  the  progenitor  of  human  immunodeficiency  viruses  acquired  through 
cross-species  transmission.  Recent  sequence  analyses  of  a  sooty  mangabey  virus  has  shown 
it  to  be  most  closely  related  to  HIV-2  (77).  To  date,  isolation  and/or  molecular  characterization 
of  SIVAGM  from  West  Africa  has  not  been  reported.  Of  additional  interest  is  the  fact  that  wild 
caught  AGMs,  unlike  experimentally  infected  rhesus  macaques,  do  not  appear  to  develop 
disease  as  a  result  of  SIV  infection.  Because  a  molecular  analysis  of  SIVAGM  from  West  Africa 
could  provide  important  information  relevant  to  the  evolutionary  relationships  of  all  HIV  and  SIV 
viruses,  and  because  the  biologic  properties  of  these  viruses  are  likely  to  be  enlightening,  we 
developed  a  PCR-based  approach  for  cloning  these  viruses  directly  from  uncultured  AGM 
peripheral  blood  lymphocytes.  Figures  37  and  38  illustrate  the  "nested"  PCR  technique  that 
we  developed  which  accomplishes  this.  Using  two  sequential  30  cycle  rounds  of  amplification, 
first  with  an  outer  primer  set  and  then  by  an  inner  set  containing  cloning  sites  for  the  Ml 3 
polylinker,  we  successfully  cloned  SIVAGM  from  four  out  of  four  seropositive  AGMs  and  from 
zero  of  two  seronegative  animals.  Figures  39,  40  and  41  show  the  nucleotide  sequences  of 
the  amplified  regions  of  the  polymerase  gene  and  their  relationship  to  SI  Vs  from  other  simian 
species  and  subspecies.  These  results  are  important  for  the  following  reasons:  (i)  they 
demonstrate  for  the  first  time  the  utility  of  the  generic  (universal)  "nested"  PCR  approach  for 
detecting  and  cloning  novel  groups  of  viruses  whose  sequence  divergence  from  known  viruses 
precludes  conventional  hybridization  and  cloning  approaches;  (ii)  they  identify  the  SIV  strain 
infecting  feral  West  African  AGMs  (sabeus)  and  show  that  this  virus  group  is  not  closely 
related  to  HIV-1  or  HIV-2;  (iii)  they  provide  a  rapid  method  for  obtaining  molecular  probes  of 
novel  viruses  for  conventional  lambda  phage  cloning  and  other  kinds  of  nucleic  acid  analyses. 
7.  To  molecularly  clone  full-length  replication  competent  HIV-1  provlruses  from 
uncultured  human  brain  specimens  for  analysis  of  virus  biology,  molecular 
determinants  of  tropism  and  virulence,  genomic  organization,  and  gene  structure- 
function  relationships. 
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All  previous  full-length  replication-competent  proviral  clones  of  HIV-1  have  been  obtained 
from  cell  cultures  of  amplified  virus,  due  to  the  low  abundance  of  viral  DNA  in  chronically 
infected  individuals  (145).  Because  such  virus  strains  may  be  altered  as  a  result  of  in  vitro 
propagation  (162),  we  sought  to  moleculariy  clone  and  genetically  and  biologically  characterize 
full-length  replication  competent  HIV-1  proviruses  directly  from  uncultured  human  brain  tissue 
of  a  patient  with  AIDS  Dementia  Complex  (ADC).  Objectives  were  (i)  to  generate  replication 
competent  proviral  clones  from  uncultured  human  tissue  thereby  allowing  an  analysis  of 
genome  organization  and  gene  structure-function  relationships  of  virus  not  subjected  to  in  vitro 
selection  pressures,  (ii)  to  generate  transfection-derived  genetically  defined  HIV-1  strains 
without  interim  cell  culture  for  analysis  of  virus  biological  properties,  replicative  DNA 
intermediate  forms,  integration  status,  and  the  possible  existence  of  defective  and/or  helper 
forms,  (iii)  to  compare  and  characterize  genotypic  variation  of  proviral  clones  obtained  by 
direct  lambda  phage  cloning  as  compared  to  clones  obtained  by  PCR  amplification.  For  these 
studies,  high  molecular  weight  DNA  from  a  brain  specimen  obtained  at  necropsy  was  subjected 
to  lambda  phage  cloning  and  10  HIV-1  proviral  clones  out  of  8  x  106  recombinants  were 
obtained  (Figure  42).  These  proviral  clones  contained  integrated  and  unintegrated  forms,  forms 
with  one  or  two  LTRs,  and  genomes  with  large  deletions  and  self-integrated,  reversed  LTR 
sequences.  Eight  HIV-1  proviral  clones  (in  lambda)  and  11  PCR  derived  clones  from  the 
same  brain  were  sequenced  in  a  525  bp  hypervariable  envelope  region.  All  19  clones  were 
highly  related  yet  distinct  with  nucleotide  variation  between  0.1  and  1.1%. 

These  nucleotide  sequence  data  derived  from  lambda  and  PCR  derived  HIV-1  molecular 
clones  from  two  different  frozen  specimens  of  brain  tissue  that  were  processed  one  year  apart 
prove  conclusively  that  the  full-length  HIV-1  proviral  clones  described  herein  are  authentic,  and 
in  fact,  represent  the  first  such  clones  from  uncultured  human  tissue.  Figures  43  and  44 
summarize  the  comparative  nucleotide  sequence  data  in  the  envelope  region  of  the  eight 
lambda  clones  and  eleven  PCR  derived  clones.  5/8  lambda  clones  and  4/1 1  PCR  clones  were 
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identical  over  this  525  base  pair  length.  The  other  3  lambda  clones  differed  by  0.19  to  0.38% 
and  the  other  7  PCR  clones  differed  from  the  predominant  (consensus)  form  by  0.19  to  1.14%. 
Interestingly,  two  deletions  and  two  stop  codons  were  identified  in  the  PCR  clones  but  not  in 
the  lambda  clones.  Figure  44  shows  the  specific  nucleotide  and  amino  acid  changes  that 
were  present  in  non-consensus  clones,  most  of  which  represented  nucleotide  transitions.  The 
biological  significance  of  these  changes  are  under  study. 

Four  proviral  clones  in  lambda  were  determined  to  be  full-length  by  restriction  mapping 
and  two  of  these  were  shown  to  be  transfection-competent  in  Cos-1  cells  and  replication 
competent  in  human  T  cells  and  monocytes  after  cell  free  passage.  We  have  found  in 
preliminary  studies  that  these  molecularly  derived  strains  of  HIV-1  from  uncultured  human  brain 
replicate  to  higher  levels  in  monocyte-macrophages  than  do  other  putative  monocyte  tropic 
viruses  and  therefore  they  represent  important  reagents  for  characterizing  monocyte  tropic  viral 
determinants  at  a  molecular  level.  The  genomic  organization,  structure-function  characteristics 
of  specific  gene  products,  and  the  biological  properties  of  these  HIV-1  proviral  clones  are 
under  study  and  will  be  pursued  aggressively  in  the  future. 
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Figure  1.  Sequence  Comparison  of  Five  Independent  HIV-1  Proviral  Clones 
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Figure  2.  Nucleotide  Sequences  of  the  Entire  Envelope  Genes  of  Five  Independent  HIV-1  Provlral  Clones. 
Sequence  information  for  BH-10  (HTLV-lllb),  ARV-2.  and  LAV-la  was  obtained  from  GENBANK.  Sequences  for 
WMJ-1  and  HAT-3  were  determined  as  part  of  the  present  study.  Alignment  of  the  sequences  was  performed 
pairwise  with  the  assistance  of  NUCALN.  Numbering  of  nucleotides  is  from  the  first  nucleotide  of  BH-10  following 
the  TAG  stop  codon,  and  genomic  regions  corresponding  to  the  signal  peptide,  extracellular  glycoprotein  (gp  120), 
and  transmembrane  glycoprotein  (gp  41)  are  shown.  Dashes  indicate  nucleotide  identify  with  BH-10  and  spaces 
indicate  the  absence  ol  that  nucleotide.  The  boxed  nucleotides  comprise  adjacent  perfect  or  imperfect  direct  repeats. 
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GCAGCAGGAAGCACTATGGGCGCAGCaTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAATTTGCTGAGGGCTATTGAGGCSCAACAGCAT  X7X3 
AXeAlaGlySetThcMecGlyAlaAXaSerMetThrLeuThcVelGlnAleArgGXnLeuLeuSerGly 1 1  eve  LG lnGlnG InAsnAsnLeuLeuArqAla I leGluAleGlnGlnHi s 

- - — - — - — -T- - T - — - — - - — — —  — - - A - - - - - - - - - A - 

Vel  Leu 

- - - - -CG--- - - — - - — . . — - - - - .... _ 

Arg 

- — - - - C - - - - - — - -  —  A - — - 

Leu 

- - - - QC - A - A - - - C - —  A - A - - - — - - 

G  ly  He  His 

CTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGAT7TGGGGTTGCTCTGGAAAACTCATTTGC  1*3  3 
LeuLeuGlnLeuThf/elTrpGXyl  X  eLy  sClnLeuGlnAl  eAr  gl  Xe  Leu  Ale  Vs  IGluArgTy  r  Leu Lys  AspGlnG  XnLeuLeuGly  1 1  eT  rpGlyCysSe  rG  ly  Ly  s  Leu  I  XeCy  s 


Vel 
-G- — 

v«l 


-ill- 

Arg 


ACCACTGCTOTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATAACATGACCTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAATACACTCC  1963 
ThrThrAlaVelPr  oTcpAsnAl eSe  rT  rpSer AsnLysSe  rLeuGluG In 1 1 eTr pAsnAsnMetThrT  rpMetGluTrpAsoAr gGluX 1 eAsnAtnTy rTh  rSe  rLeuI leHisSer 

Asp  Asp  Gin  Glu  Asp  AsnThr  TyrThr 


The 

~Thr~ 


£Asg  Glu  Asp  TyrAsn 

-A- TAT - - - - - C - A - - - G - 0--A— - T — AA- 

Asnnec  Gin  Glu  Asp  Glylle  TyrAsn 


5t££^?A?ASwT5fA£^G^A7rA7TG9AA7TAGATAAAIGGGGAAGTTTGTGGAArrGGT‘”AACATAACAAATTGGCTGTGGTATATAAAATTA  20  7 

-a-———?  1  u*luL  *uoluL#uA,pL/  pl*s*rL#uTrpA‘nTrpph#A*nI1*ThrAanTrpL*uTrpT*t2I#Ly*L*u 


tie 
-O-A — 
Argtle 


^JCAf  AATCAf  AGtAGGaGGCTTgGTAcGTTTAaGAaTAGTTtTTGCtGtAcTTTC?GTAGTGaaTAGAGTTAGGCAGGGATATTCACCATTATCGTTTCAGACCCACCTCCCAATCCCG  219  3 

l*vsl*h*AleValLeuSecVelVeiAsnArgVelArgGlnGlyTycSerProLeuSerPheGlaThrHlsLeuProHerro 


Vel 
— C — 
Thr 


-c - A— 

Lys 


lie 

lie 


Thr 
-GC — 
Ale 


-G-- - G - C-T - T - 

V*1  His  ?he 

-GGCG-TGC - - — T— — T GA - 

GlyGXyAl e  Phe  Thr 


-T - C- 

TcpThr 


. tn . . . »*»  . ;S; - ‘ . 


- A - 1—— _ c _ _Thr  Ale  Ar  g  Leu  His 


A - A - G-G - T - TT - 

Lys  G  ly  lie 

- - - T— A - A - 

Thr 

AGAAGAATAAGACAGGGCTTGGAAAGGATTTTGCTA  26*9 
ArgAr gl 1 eAr  gGlnG lyLeuGl uArgl l e Leu Leu 


Cys 

- - - - A — ATC-G- - AT  — 

,  ArglleCys  lie 

- A- - C - A--AT-CT - T - TT- 

Ale  ArglleLeu  PheLeu 


■A- 


G 
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Figure  3.  Amino  Acid  Sequences  of  the  Entire  Envelope  Genes  of  Five 
Independent  AIDS  Virus  Isolates.  Sequence  information  for  BH-10  (HTLV-lllb),  ARV- 
2,  and  LAV-1  a  was  obtained  from  GENBANK  and  information  for  WMJ-1  and  HAT-3 
from  the  present  study.  Alignment  of  the  sequences  was  performed  pairwise  with  the 
assistance  of  PRTALN.  Numbering  of  amino  acids  is  from  the  first  amino  acid  of  BH- 
10  and  regions  corresponding  to  the  signal  peptide,  extracellular  envelope  glycoprotein 
(gp  120),  and  transmembrane  glycoprotein  (gp  41)  are  shown.  Dashes  indicate  amino 
acid  identify  with  BH-10  and  spaces  indicate  he  absence  of  that  amino  acid.  Arrows 
denote  cysteine  residues  and  solid  and  open  circles  denote  conserved  and  nonconserved 
sites  of  potential  N-linked  glycosylation,  respectively.  Darkly  shaded  regions  within  the 
extracellular  envelope  glycoprotein  correspond  to  regions  of  hypervariability  as 
determined  by  both  visual  inspection  and  by  computer  analysis  of  variation  (see  Figure 
4).  Lightly  shaded  regions  correspond  to  areas  that  are  relatively  highly  conserved. 
Note  that  additional  regions  of  intermediate  variability  are  interspersed  among  highly 
variable  and  highly  conserved  areas. 
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whose  secondary  structures  are  depicted  in  Figure  5. 


Figure  5 
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Figure  5.  Predicted  Secondary  Structure  and  Hydrophiiicity  Patterns  of  Representative  Variable 
and  Conserved  Envelope  Regions  of  BH-10,  ARV-2,  LAV-1  a,  WMJ-1,  and  HAT-3. 


The  left  hand  panel  corresponds  to  the  first  major  variable  region  of  the  extracellular  viral  envelope, 
amino  acids  137-158  in  Figure  3.  The  right-hand  panel  corresponds  to  the  largest  conserved  region, 
which,  in  addition,  possesses  structural  and  hydrophiiicity  properties  suggestive  of  antigenicity.  This 
region  corresponds  to  amino  acids  473-518  immediately  5’  to  the  envelope  processing  site  shown  in 
Figure  3  .  Ovals  denote  hydrophilic  regions  and  diamonds  hydrophobic  regions.  The  radius  of  an  oval 
or  a  diamond  over  a  residue  is  proportional  to  the  mean  hydrophiiicity  or  hydrophobicity  as  calculated 
for  that  residue  plus  the  next  four  residues.  Positions  of  p  turns  are  indicated  by  turns  of  the  chain 
(  cEL  )-  a  helices  as  bold  coils  (<W^  ),  P  sheets  as  narrow  zigzags  ( A/V^  ),  and  random  coils  as 
wide  zigzags  (  /N/\  ). 
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WMJ1 

WU2 

WKJ3 

WHJ1 

WMJ2 

WMJ3 

WHJl 

WKJ2 

WHJ3 

WJ1 

VMJ2 

WMJ3 

WMJ1 

WMJ2 

WMJ3 

WHJ1 

WHJ2 

WHJ3 

WHJ1 

WMJ2 

WHJ3 

NNJ1 

VKJ2 

VHJ3 

VIU1 

m2 

WHJ3 

viui 

VNJ2 

WU3 

WU1 

VKJ2 

WU3 

WMJ1 

WU2 

WHJ3 


SIGNAL  PEPTIDE  < - \/ - > 

GAGAAAGAGCAGAAGACAGTGGCAATGAGAGTGAAGGGGATCAGGAGGAATTGTCAGCACTTGTGGATATGGGGCACCATGCTCTTTGGGATGTGGATGATCTGTAGTGCTGTAGAACAA 

GluLysGluGlr\LysThc7alAlsrt»tAcgVslLysGlyIl*ArgArgAsnCysGiniUsL*uTcpn*TrpaiyTh£.l«tL«uPh#ClyfUtTrpH«tIl#CysS«rAliValGluGln 

N«t 


120 


Lau 

EXTRACELLULAR  REGION 

TTGTGGGTCACAGTCTATTATGGGGTACCTGTGTGGAAAGAAGCAACCACCACTCTATTCTGTGCATCGGATGCTAAAGCATATAGTACAGAGGCACATAAGGTCTGGGCCACACATGCC 

LauTrpv*]>ThrV«lTycTyrGly7*£ProV«lTrpLysGluAUThrThrThrLaMPh*Cy*AlaSacAspAlaLysAl*TyrS#rThrGluAl*HisLy»ValTryAlaThrHxsAla 

Asn 

Aip  Asn 

TOTGTACCCACAAACCCCAATCCACAAGAAGTAGTATTGGAAAATGTGACAGAAAATTTTAACATGTGGAAAAATAATATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGAT 
C7jV«lProThrA»n?coA»nProGlnGluV»lv»H.*uGluA»nV*l?hrGluAinPh«AsnM«cTrpLyjA«nAjnff»cv*l51uGlnM«cHi»GluA»pIl«Il«s*rL«uTrpA»p 
— — ~--G— - — — — — — — — —  A— — — G- -  - 


Aip 


Gly 

G- 


Gly 


CAAAGCCTAAAAC  CATGTGTAAAATTAACCCCACTCTGTGTTACTTTAAATTGCATTGATAAGAACATTACTGATTGGGAGAATAAAACAATAATAGGAGGAGGAGAAGTAAAAAACTGC 
QlnSagLauLysProCysValLysLauThcPgoLauCysVslThcLauAanCysIlaAipLysAsnnaTh  rAspTcpGluAsnLpThr  IlsIlsGlyGlyOlyGluValLysAsnCys 


I  la 


Lys  Thr 

— C - - - 

The 


■A-»G~~~~ 

Arg 


TCTTTCAATATCArCACAAGCATAAGACATAAGGTGCATAAAGAATATGCACTTTTTTATAAACTTGATGTAGTACCAATAAAGAGTAATAATGACAGTAGTACATATAGATTGATACAT 
JafPhaAsnllathrThrSarllaAr  ijAspLysValHisLysGluTyrAlsLauPhaTy  rLysLauAspVslvalProZ  laLysSarAsnAsnAspSarSarThrTy  rArgLsuIlsHis 

Arg  GlyAsp  Asn  Atg  Asn 

Asn 

TCTAATACCTCAGTCATTACACAGGCCTGTTCGAAGGTATCCTTTGAACCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAGTGTAATGATAAGAAGTTCAATGGA 
CysAsnThrSarVaiZlsThrGlnAlaCysSacLysValSarPhsGluProClaProIlsHLsTyrCysAlaProAlsGly  PhaAlallaLauLysCysAsnAspLysLysPhaAsnGly 


Pro 

-C-A- 

Pto 


ACAGGACCATGTACTAAT5TCAGCACAGTACAATGTACACATGGAATTAGACCAGTAGTGTCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGATAGTAATTAGATCTGAA 
ThcGiyProCysThrAsnValSarThcValGlnCysTarHxsGlyllaArgProValValSacThtGlnLauLauLauAsnGlySacLauAlaGluGluGluI  laValllaAcgSacGlu 


Ila 


AATTTCACAGACAATGCTAAAACCATAATAGTACACCTGAATGAATCTGTAGAAATTAATTGTACAAGACCCAACAACAATGTAAGAAGAAGACATATACATATAGGACCAGGGAGAGCA 

AsnPhaThrAspAsnAlaLysThrIlaIla7alHisLanAsnGluSacVslGiuIl»AsnCysThcAggProAsnAsnAsnV«lAcgArgArgHxsIlaHisIlaGiyProGlyArgAla 

Tyt  SarLauSar 

- - - - - — G - T — C -  - G - 

AspIlaAls  Arg 

rrTTATACAGGAGAAATAAGAGGAAATATAAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATAACACTTTAAAACAGATAGTTGAGAAATTAAGAGAACAATTTAAGAATAAA 
PhaTyrThTGlyGluIlaArgGlyAsnllaArgGlaAlsHisCysAsallaSarAf  gAlsLysTrpAsnAsnThrLauLysGlnllaVslGluLysLauArgGluGlnPhaLyiAsnLys 

Arg  Arg  Ila  Zla 

- a - r - - - 

Lys  Ila 

ACAATAGTCTTTAATCATTCCTCAGGAGGGGACCCAGAAATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTGAITCAACACAACTGTTTAATAGTACTTGGAATGTT 
ThrllaValPhiAsnXisSarSsrGlyGlyAapProGluIlsValThrHisSirPhaAsnCysGlyGlyGluPhaPhiTyrCysAspSarThrGlnLauPhiAanSarThrTrpAsnVal 
-  --- - - - - — - -  •  - - - - - — - - A — * — — — . . — — 0- 

Gly 


1080 


1200 


Asn 

— 1 — — - -- - — TC — - - - - ■- - - — - - — a- - A - 

Sar  AsnThr  Lys 

ACTOGCATTGAAGGAAATAATAACACTGAAGAAAATATCACACTTCCATGCAGAATAAAACAAATTATAAACATGTGGCAGGGAGTAGGCAAAGCAATGTATGCCCCTCCCATCGCAGGA 
ThrQlyIlaGl7GlyAanAsnAsnThrGluGluA<nIlaThgLiuProCyiArgll«LysGlnIlaIlaAsnMitTrpGlnGly7t  lGlyLysAlaHatTyrAlaProProIlaGlYGly 


Asp 


Lys  Asp  LysAsnSarThrLau 


Gin 

-A-C- 

Sar 


CAAArrAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAGCAGCAGGGAAGAGATCTTCAGACCTGGAGGAGGAAATATGAGGGACAATTOGAGAAGT 

QlnllaArgCysSarSacAsnllaThcGlyLauLauLauThrArgAspGlyGlyAsnSarSarSarArgGluGluIlaPhaArgProGlyGlyGlyAsnJIatArgAspAsnTrpArgSac 


1440 


Figure  8.  Nucleotide  and  amino  acid  sequence  alignment  of  the  entire  envelope  genes  of  three  sequential 
virus  isolates  from  patient  WMJ.  Molecular  clones  of  WMJ-1,  WMJ-2,  and  WMJ-3  [designated  A.WMJ-1  (V),  XWMJ- 
2  (III),  and  >.  WMJ-3  (III),  respectively]  were  obtained  and  their  envelope  genes  sequenced.  Alignment  of  the 
sequences  was  performed  pairwise  with  the  assistance  of  NUCALN.  Sequences  are  numbered  according  to  WMJ- 
1  starting  with  the  first  nucleotide  following  a  TAA  stop  codon.  Genomic  regions  corresponding  to  the  signal  peptide, 
the  extracellular  glycoprotein  (gp120),  and  the  membrane-associated  glycoprotein  (gp41)  are  shown.  Dashes  indicate 
nucleotide  identify  and  spaces  indicate  deletions  in  WMJ-2  and  WMJ-3  with  respect  to  WMJ-1. 
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Figure  8  (cont.) 


EXTRACELLULAR  REGION  < -  \/ - >  TRA5S«EMBRAifE  REGIOV 

GAArrATATAAATATAAAGTAGTAAAAArrCAACCAT?AGGAGTAGCACCCACCAAGGCAAAGACAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAGCAATAGGAGCTATCTTCCTr  15(0 
GluLeuTYrLysTyrLysVaiyslLysXleGluProLsuGlyVelAleProThgLysAleLysAggAggVeiyalGlaAggGluLyaAggAlsYslGlyAlsIleGlyAlsrte  tPheLau 


GGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCSTCACTGACGCTGACaGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAACAGCAGAACAATTTGCTGAGGGCTATTGAG  1(10 
GlyPheLeuGlyAlaAlaGiySegThgrtetGiyAleAleSegLeuT&gLeuThg'/sXaiftAleArgOinLeuLeuSegGlylieVelGlaGlaGlAAjnAJaLeuLeuArgA.lAXleGlii 

-■  - — — — — —OC— — — -»-■  *  -  - — — — — — ~ - — — - — —  ■  - — — — -  -  -  -  "■  --"T 

Gly  Asp 


aCGCAACAGCATCTGTTGCAACTCACGGTCTGGGGCATCAAACAGCTCCAGGCAAGAGTCCTGGCTGTGGAAAGATACCTAAGGGATCAACAGCrCCTAGGGArrTGGGGTTGCrCTGGA  1100 
AlsGlnGlnHisLeuLeuGlnLeuThgVelTgpGly  IleLysGlnLeuGlaAisAggVslLeuAle  VslGluAcgTygLeuArgAspGlnGlnLeuLeuGlylleTgpGlyCysSegGiy 


AAACTCATTTGCACCACTACTGTGCCTTGGAACGCTAGTTGGAGTAATAAATCTATGGATCAAATTTGGAATAACATGACCTGGATGGAGTGGGAAAGAGAAATTGACAATTACACAAGC  1920 
LysLsuXlsCysThgTagThgVslPgoTgpAsaAlsS#  gTgpSe  cAinLyaS#  cMstAspGlallsTtpAsnAstiKscThgTgpHetGluTrpGluArqGim  IsAs  pAsaTyrThrSeg 


LeuAsa  As  p  Leu  Glu  Asa 

TTAAT AT ACAACTT AATTG AAG AATCG CAG AAC CAGCAAGAAAAGAATGAACAAGAATTATTAGAATTGGATAAGTGGGCAAGTCTGTGGAATTGGTTTTCCATAACAAACTGGCTGTGG  2040 
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Figure  9.  Computer  graphic  illustrating  relative  variation,  or  divergence,  in  the 
envelope  genes  of  five  independent  AIDS  virus  isolates  and  the  position  of 
amino  acid  differences  among  WMJ-1,  WMJ-2,  and  WMJ-3.  The  five  independent 
isolates  were  HTLV-lllb  (BH-10),  LAV  (LAV-1  a),  ARV  (ARV-2),  HTLV-IIIRF  (HAT-3),  and 
HTLV-III^,  (WMJ-1).  Their  relative  degrees  of  variation  were  calculated  and  plotted 
on  the  ordinate  from  0  (minimum  divergence)  to  10  (maximum  divergence).  Beneath 
the  abscissa  are  single  letter  codes  for  conserved  amino  acids  as  defined  by  identity 
in  five  out  of  five  or  four  out  of  five  of  the  sequences.  Arrows  beneath  this  relative 
consensus  sequence  indicate  the  position  of  amino  acid  changes  in  WMJ-2  and  WMJ- 
3  compared  to  WMJ-1.  Double  arrows  represent  identical  amino  acid  substitutions  in 
WMJ-2  and  WMJ-3  compared  to  WMJ-1  while  boxed  double  arrows  indicate  different 
amino  acid  substitutions  Numbering  of  amino  acids  is  based  on  the  envelope 
sequence  as  shown  on  the  abscissa.  Note  that  most  of  the  amino  acid  differences 
among  the  three  WMJ  isolates  are  confined  to  regions  of  either  high  or  intermediate 
envelope  variability. 


Figure  10.  Restriction  endonuclease  cleavage  patterns  for  7  independent  HIV- 
1  viral  clones  (HXB2;  LAV- Mai;  LAV-Eli;  ARV2;  WMJ1;  RJS4a;  WMFIa)  and  for 
17  distinguishable  clones  from  isolate  RJS4,  10  different  clones  from  isolate 
WMF1,  and  13  distinct  clones  from  isolate  WMF3.  To  the  right  of  each  set  of 
restriction  maps  is  a  table  summarizing  the  different  restriction  enzyme  patterns  for 
each  representative  clone  within  that  set  (E-EcoRI;  S-Sstl;  X-Xhol;  B-Bglll;  U-Pvull; 
H-Hindlll;  P-Pstl;  K-Kpnl;  M-BamHl;  C-Cvnl;  A-Xbal).  The  top  clone  pattern  in  each 
set  is  arbitrarily  designated  11111111111  and  differences  in  subsequent  clone  patterns 
are  identified  sequentially.  The  laboratory  clone  (e.g.,  RJS4.1;  RJS4.11;  RJS4.16; 
etc.)  designation  and  the  total  number  of  viral  clones  represented  by  each  pattern  are 
also  indicated. 
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Figure  11 


Figure  11.  Analysis  of  variation  among  viral  DNA  clones  of  HIV-1.  Each  of  the 
45  different  viral  clone  patterns  depicted  in  Figure  1 0  was  compared  pairwise  to  every 
other  clone  pattern  and  the  percentage  difference  in  restriction  endonuclease  cleavage 
sites  between  each  pair  was  calculated  as  follows: 

A  j.  R 

— . —  -=  X  1 00  =  %  Restriction  Site  Differences 

c 

where,  A  equals  the  number  of  restriction  sites  present  in  one  clone  (X)  that  are 
missing  in  the  other  clone  (Y);  B  equals  the  number  of  restriction  sites  present  in 
clone  Y  that  are  missing  in  clone  X;  C  equals  the  total  number  of  restriction  sites 
present  in  clones  X  and  Y  combined,  with  identical  sites  in  the  pair  counted  only 
once.  For  example,  if  clone  X  and  clone  Y  had  25  restriction  sites  in  common  and 
no  additional  sites,  they  would  have: 

Q  +  ^  X  100  =  0%  Restriction  Site  Differences 

25 

If  clones  X  and  Y  each  had  25  restriction  sites  but  none  were  in  common, 
they  would  have: 


25  +  25  x  100  *  100%  Restriction  Site  Differences 

50 

This  analytic  approach  allowed  each  of  the  45  distinct  clone  patterns  to  be  compared 
with  every  other  clone.  Because  EcoRI  and  Sstl  enzymes  were  used  to  clone  the 
RJS  and  WMF  viruses,  respectively,  these  enzyme  sites  were  not  included  in  the 
analysis  of  variation  among  the  RJS4,  WMF1,  and  WMF3  clones.  Because  the  two 
terminal  Sstl  sites  in  all  clones  except  WMFIb,  c,  h,  i,  and  j  actually  represent 
redundant  sequences  in  their  LTRs,  only  one  of  the  two  was  included  in  the  analysis 
of  variation.  Thus,  the  first  pair  of  clones  compared  in  Table  1  (HXB2  versus  LAV- 
Mai)  differ  by  a  total  of  27  out  of  43  restriction  sites,  or  63%. 

^  T  -  0  X  100  -  63%  Restriction  Site  Differences 

43 

Boxed  in  solid  lines  are  the  difference  scores  for  viral  clones  derived  from  the  RJS4, 
WMF1 ,  and  WMF3  isolates.  The  dashed  lines  highlight  the  comparison  of  viral  clones 
between  isolates  WMF1  and  WMF3,  both  of  which  were  derived  from  the  same 
chronically  infected  individual  16  months  apart.  In  order  to  compare  statistically  the 
extent  of  dissimilarity  among  clones  from  within  individual  isolates  (RJS4a-q;  WMFIa- 
j;  WMF3a-m)  versus  clones  from  independent  (unrelated)  isolates  (HXB2,  LAV-Mal, 
LAV-Eli,  ARV2,  WMJ1),  the  individual  data  points,  not  mean  percentages,  of  the 
respective  groups  were  compared  by  chi  square  analysis. 
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Figure  12.  Southern  blot  restriction  enzyme  cleavage  patterns  of  parallel  and 
sequential  isolates  of  HIV-1  (Panels  A  and  B)  and  of  cultured  versus  uncultured 
virus  (Panel  C).  Panel  A,  lanes  1  and  2  represent  viral  isolates  obtained  by  co¬ 
cultivating  a  single  (split)  sample  of  PBMC  from  an  HIV-1  infected  subject  (RJS)  with 
PHA-sti mutated  lymphocytes  from  two  different,  HLA-unrelated  normal  donors.  Panel 
B,  lanes  1  and  2,  are  the  same  as  lanes  1  and  2  from  panel  A.  Lanes  3  and  4 
correspond  to  virus  isolates  derived  from  the  PBMC  of  this  same  HIV-1  infected 
subject  taken  2  weeks  and  4  weeks  after  the  first  phlebotomy  and  co-cultured  again 
with  PBMC  from  two  other  unrelated  normal  donors.  The  right-hand  blot  in  panel  B 
is  identical  to  the  center  blot  except  that  the  exposure  time  was  lengthened  from  two 
to  five  days  in  order  to  identify  faint  signals.  Notice  that  the  predominant  Hindlll 
patterns  of  isolates  3  and  4  are  distinguishable  from  isolates  1  and  2,  yet  with  long 
exposure  films  persistence  of  common  viral  genotypes  can  be  identified  in  all  isolates. 
Panel  C,  Southern  blot-hybridization  patterns  of  HIV-1  DNA  extracted  directly  from 
uncultured  brain  tissue  (lanes  2)  and  from  HIV-1  viral  cultures  derived  by  lymphocyte 
co-cultivation  with  the  same  brain  tissue  specimen  (lanes  4).  Lanes  1  show  an  HIV- 
1  positive  control  DNA  from  an  unrelated  virus  isolate  and  lanes  3  represent 
uninfected  negative  control  DNA. 
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Figure  13.  Amino  acid  sequences  of  the  entire  envelope  genes  of  seven  HIV  isolates,  WMJl ,  WMJ2,  and 
WMJ3  are  sequential  isolates  from  the  same  patient:  BHIO,  ARV2,  LA  Vi  -a,  and  HAT-3  represent  independent 
virus  isolates.  Alignment  of  the  sequences  was  done  with  the  assistance  of  PRTLAN.  Numbering  of  amino  acid 
sequences  is  done  from  the  first  residue  of  the  envelope  protein  open  reading  frame  of  WMJl  up  to  the  end. 
Dashes  indicate  amino  acid  identity  with  WMJl ,  spaces  indicate  the  absence  of  that  amino  acid.  N-glycosylation 
sites  are  indicated  as  shaded  areas,  amino  acids  in  p  turn  regions  are  encircled.  Symbols:  *,  cysteine  residues; 
—  ,  conserved  regions; ✓'n^v  variable  regions. 


78 


Figure  14.  Chou-Fasman  prediction  of  the  envelope  protein  gp160  derived  from  the  sequence  of  WMJ1 . 
The  computer  plots  start  at  the  methionine  at  residue  9  of  the  open  reading  frame.  The  probability  of  the 
occurrence  of  a-helices  p -pleated  sheets  ( ),  random  coils  and  (3  turn  regions  (CO), 

were  evaluated  by  using  stringent  conditions.  The  parameters  for  hydrophilicity,  flexibility,  and  surface 
probability  were  averaged  over  five  amino  acid  residues,  with  a  limit  of  0.7  for  hydrophilicity,  1 .040  for 
flexibility,  and  5.0  for  surface  probability.  Symbols:  •- ,  N-glycosylation  sites;—,  conserved  regions; 
variable  regions.  (A)  Secondary  structures  superimposed  with  the  values  for  hydrophilicity;  0,  hydrophilic 
regions,  O,  hydrophobic  regions.  (B)  Secondary  structures  superimposed  with  the  values  for  flexibility;  0. 
flexible  regions.  (C)  Secondary  structures  superimposed  with  the  values  for  surface  probability;  0 ,  high 
surface  probability. 
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Figure  15.  Parameters  and  values  for  conserved  and  variable  regions, 
calculated  from  the  sequences  of  seven  HIV  envelope  proteins  (gp  160)  (see 
Figure  13). 
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Figure  16.  Predicted  antigenic  epitopes  in  seven  strains  of  HIV-1. 
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Figure  17.  Nucleotide  sequence  differences  between  molecular  clones  of  HTLV- 
4,  3TLV-3AOM  and  STLV-3(SIV)MAC 
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Figure  18.  Distribution  of  amino-acid  sequence  differences  in  the  envelope 
genes  of  STLV-3AQM>  STLV-3  (SIV)^  and  HIV-2  compared  to  HTLV-4.  Vertical 
lines  represent  the  position  of  single  amino-acid  sequence  differences  in  the  envelopes 
of  STLV-S^KSW,  STLV-3.0M/K78,  SIV^/Mml 42-83,  and  HIV-2R00  compared  with 
HTLV-4/PK82,  as  determined  by  pairwise  alignment  using  the  program  PRTALN. 
Arrows  depict  the  position  of  amino-acid  insertions  and  deletions.  An  asterisk  depicts 
the  location  of  a  TAG  in-frame  stop  codon  in  the  transmembrane  region  which  is 
present  in  the  clones  of  HTLV-4,  STLV-3AGM  and  STLV-3  (SIV)MAC,  but  not  in  HIV- 
2,,00.  SP,  signal  peptide;  ECR,  extracellular  region;  TMR,  transmembrane  region. 
Relative  amino-acid  positions  are  shown  on  the  bottom  scale. 
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Figure  19.  Determination  of  relative  nucleotide  sequence  homology  of  a  newly 
isolated  Senegalese  retrovirus  (isolate  ST)  to  HIV-2„00  (panel  a)  and  HTLV- 
4/STLV-3  (panel  b).  Panels  a  and  b  are  identical  except  for  the  viral-specific  nucleic 
acid  probe  used.  Lane  1 ,  HTLV-4/PK82;  Lane  2,  normal  human  DNA;  Lanes  3-6,  4 
different  subcultures  derived  from  the  parental  ST  virus  isolate.  Preferential 
hybridization  of  the  HIV-2  specific  probe  to  ST  isolate  DNA  is  seen  (panel  a,  lanes 
3-6).  The  HTLV-4/STLV-3  specific  probe  hybridizes  much  less  efficiently  to  ST  DNA 
(panel  b,  lanes  3-6)  but  hybridizes  to  HTLV-4  DNA  on  the  same  filter  (panel  b,  lane 
1).  Lanes  5  and  6  (panel  a)  show  two  distinct  ST  viral  genotypes  polymorphic  in  a 
single  SamHI  restriction  site.  Virus  cultures  (lanes  3  and  4)  contain  a  mixture  of 
these  two  predominant  viral  forms  which  also  differ  in  their  Xba\  restriction  patterns 
(data  not  shown).  Size  markers  are  given  on  the  left  in  kilobases  (kb) 
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Figure  20.  Genetic  characterization  of  HIV-2/ST  by  differential  nucleic  acid 
hybridization,  restriction  enzyme  analysis  of  HIV-2/ST  DNA,  and  comparison  of 
full-length  HIV-2/ST,  HIV-2/ROD,  and  SIV,^  proviral  DNA  clones.  Identical 
nitrocellulose  filters  containing  agarose  gel  separated  Bam  HI  restriction  digests  of 
vi rally-infected  cellular  DNA  (10ug  each)  were  prepared  (B).  HIV-1/lllb  (lanes  1); 
HIV-2/ROD  (lanes  2);  SIV^-251  (lanes  3);  normal  human  lymphocyte  DNA;  HIV-2/ST 
(cell  line  SupTI/LKOOl)  (lanes  5);  and  HIV-2/ST  (cell  line  Hut78/Bl2)  (lanes  6). 
Hybridization  was  to  a  “P-labelled  4.3  kb  Pst  l-Pst  I  (poAcentral  regio n-env  fragment 
of  HIV-2/ROD  probe  (Fig.  1C)  and  filters  were  washed  at  low  (3  x  SSC,  0.2%  SDS, 
55°C)  or  high  (0.1  x  SSC,  0.2%  SDS,  65°C)  stringency.  DNA  blot-hybridization 
restriction  cleavage  analysis  with  the  same  probe  as  in  (B).  SupTI/LKOOl  and  Hut 
78/BI  2  are  permanent  producing  HIV-2/ST  infected  cell  lines,  the  latter  derived  by 
single  cell  cloning  of  a  primary  Hut  78  infected  line.  Restriction  enzymes  were  Xba 
I  (lanes  1),  Xho  I  (lanes  2),  Bam  HI  (lanes  3),  Eco  Rl  (lanes  4),  Hinc  II  (lanes  5),  and 
Kpn  I  (lanes  6).  (A)  Restriction  maps  of  three  full-length  recombinant  proviral  DNA 
clones  of  HIV-2/ST  (JSP4-27;  JSP4-32;  JSP4-34)  obtained  by  Sau3AI  (Mbo  I)  partial 
digestion  of  Hut  78/BI  2  DNA  and  ligation  into  J1  -lambda  phage.  Restriction  enzymes 
shown  are  Pst  I  (P),  Bam  HI  (B),  Xho  I  (O),  Bgl  II  (G),  Eco  Rl  (E),  Hind  III  (H),  Xba 
I  (X),  Sal  I  (L),  and  Sst  I  (S).  Asterisks  denote  sites  in  HIV-2/ROD  or  SIV„AC  that  are 
also  present  in  the  HIV-2/ST  clones.  Dotted  lines  denote  unique  flanking  cellular 
sequences  in  the  HIV-2/ST  clones. 
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Figure  21.  (A)  Western  immunoblot  and  (B  and  C)  RIP  analysis  of  HIV-2/ST, 
HIV-2/ROD,  SIV^c,  and  HIV-1/lllb.  (A)  Immunoblots  were  performed  using  a  human 
(West  African)  antiserum  to  HIV-2  (lanes  1-8)  and  a  human  antiserum  to  HIV-1  (lanes 
9-12).  Bound  antibody  was  detected  with  peroxidase-conjugated  goat  antibody  to 
human  immunoglobulin.  Antigen  preparations  were  viral  lysates  from  HIV-2/ST  line 
SupTI/LKOOl  (lanes  1,  12);  single  cell  derived  clones  from  SupTI/LKOOl  designated 
ST.  17  (lane  2),  ST.9  (lane  3),  ST.24  (lanes  4,  11);  HIV-2/ROD  (lanes  5,  10);  SIV,^ 
(lanes  6);  HIV-1/lllb  (lanes  7,  9);  uninfected  control  cells  (lane  8).  (B)  RIP  of 

HIV-2/ST  env  proteins  from  cloned  cell  lines  of  infected  SupTI  cells  shown  in  (A). 
Cell  lysates  from  [“Sjcysteine  and  [“Sjmethionine  labeled  clones  were  prepared  and 
immunoprecipitated  with  either  normal  human  serum  (lanes  1,3,5,  7)  or  serum  from 
a  West  African  patient  with  HIV-2  infection  (lanes  2,  4,  6,  8).  Shown  are  clones 
designated  ST.9  (lanes  1  and  2),  ST.17  (lanes  3  and  4),  ST.23  (lanes  5  and  6)  that 
have  an  env  precursor  molecule  of  approximately  180  kD  and  ST.24  (lanes  7  and  8) 
which  has  an  env  precursor  molecule  of  170  kD.  The  smaller  sized  env  precursor 
(gp170)  of  ST.24  corresponds  to  the  truncated  transmembrane  protein  (gp30)  of  the 
same  virus  (Fig.  1A).  (C)  Co-precipitation  of  HIV-2/ST  envelope  glycoproteins  with 
CD4.  Cell  lysates  from  [“Sjcysteine  and  [^Sjmethionine-labeled  uninfected  SupTI 
cells  (lanes  1-5)  and  HIV-2/ST  infected  SupTI/LKOOl  cells  (lanes  6-10)  were  prepared 
and  immunoprecipitated  with  OKT3  (lanes  1  and  6),  OKT4  (lanes  2  and  7),  OKT4A 
(lanes  3  and  8),  normal  human  serum  (lanes  4  and  9),  or  human  serum  from  a 
patient  with  HIV-2  infection  (lanes  5  and  10).  Both  OKT4  and  OKT4A 
immunoprecipitate  the  56kD  CD4  molecule  from  the  uninfected  cells.  However,  in  the 
infected  cells  OKT4  (but  not  OKT4A)  immunoprecipitates  CD4  as  well  as  three 
molecular  species  of  high  molecular  weight  corresponding  to  viral  env  gene  products 
that  are  also  immunoprecipitated  by  immune  human  serum  (lane  10). 
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Figure  22.  Syncytium  formation  by  HIV-2/ST,  HIV-2/ROD,  and  HIV-1/lllb  infected 
cells  and  CD4*  SupTI  indicator  ceils.  Virally-infected  SupTl  cells  were  mixed  1:5 
with  uninfected  SupTI  cells,  incubated  24  hours,  and  photographed.  Results  were 
identical  using  Hela-T4,  H9,  and  Hut  78  as  uninfected  indicator  cells  and  when  using 
HIV-2/ST  infected  H9  and  Hut  78  as  env  expressing  effector  cells.  The  HIV-2/ST  cell 
line  (ST/LK001)  and  single  cell  derived  clones  (ST.9  and  ST.24)  are  described  in  the 
legend  to  Fig.  21. 


Figure  23.  Comparison  of  cyto  path!  e/cell  killing  proportion  of  HIV-2/ST  with  HIV-2/ROD  and  with  HIV-1  Isofotos  lltb,  BC,  WMJ, 
RH.  Each  panel  summarizes  a  21  day  experiment  in  which  equivalent  amounts  of  ceii-lree  virus  (250,000  cp.m.  RT  activity)  were  applied 
to  cultures  ol  uninfected  SupTl  cells  (A,  B)  or  PHA-stimulaled  peripheral  blood  lymphocytes  (Panels  C,  D).  In  Panel  E,  7,500  cpm.  RT 
activity  of  all  viruses  were  used  except  for  HIV-2/ST  10X  (75,000  c.pm.)  and  HIV-2/8T  100X  (750,000  c.p.m.).  Shown  are  supernatant 
reverse  transcriptase  activities;  percentages  of  cells  infected  with  virus  as  determined  by  indirect  immunofluorescence;  and  cell  killing 
assessed  by  viable  cell  counts  with  trypan  blue  exclusion,  hemocytometry,  and  automated  cel  counting.  Identical  culture  splits  were  done 
weekly  to  keep  the  concentration  of  control  cells  between  0.5  x  10*  and  1.0  x  lO'/ml  and  ceil  killing  was  expressed  logarithmically  as 
decreases  in  viable  ceil  numbers.  Syncytia  were  scored  from  0  (none)  to  4+  (extensive)  and  are  shown  in  brackets  (<  >).  Mock  infected 
culture  supernatants  lacking  virus  served  as  controls.  Ten  percent  (w/v)  final  concentration  of  polyethylene  glycol  (PEG)  was  used  to 
concentrate  virus  for  experiments  shown  In  Panels  B,  0,  and  E. 
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Figure  24.  Time  course  of  viral  infection  by  HIV-1  Hlb,  HIV-2R0D,  and  HIV-2ST 
assessed  by  in  situ  hybridization.  The  relative  amount  of  viral  mRNA  production 
per  cel!  was  scored  qualitatively  from  silver  grain  densities:  absent,  to  ++++,  too 
numerous  to  count.  Although  there  was  a  marked  delay  in  the  initial  development  and 
spread  of  productive  viral  infection  by  HIV-2ST,  the  few  cells  that  became  productively 
infected  at  early  time  points  expressed  equal  amounts  of  viral  RNA  on  a  per  cell  basis 
compared  with  prototype  HIV-1  and  HIV-2 
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Figure  25.  Alignment  of  the  deduced  X  amino  acid  sequence  of  HIV-2  and 
SIV^.  Shown  are  eight  open  reading  frames  known  to  encode  HIV  proteins  as  well 
as  the  novel  X  open  reading  frame.  A  comparison  of  the  deduced  X  amino  acid 
sequence  is  shown  below  for  HIV-2  {isolate  ROD)  and  SIV,^  (isolate  PK82).  Vertical 
bars  indicate  sequence  identity.  The  first  19  residues  (underlined  in  the  HIV-2  and 
SIV^  X  sequence)  represent  the  oligopeptides  which  were  synthesized  to  raise  rabbit 
anti-X  immune  sera. 
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Figure  26.  Prokaryotic  expression  of  the  SIVMAC  X  protein.  (A)  Construction  of 
pATH-X.  The  prokaryotic  expression  vector,  pATH,  was  used  to  express  the  entire 
SIVMAC  X  open  reading  frame  minus  the  first  two  amino  acid  residues.  A  422  base 
pair  Xholl/Hindlil  fragment  of  SIV^  clone  PKE102  was  inserted  into  pATH  at  the 
BamHI  and  Hindlll  sites  as  depicted.  This  placed  the  SIVMAC  orf  inframe  and 
downstream  of  the  bacterial  Trp-E  gene  to  allow  its  expression  as  a  50kD  fusion 
protein.  A  unique  Nrul  and  EcoRI  site  were  subsequently  used  to  remove  95%  of  the 
Trp-E  coding  region,  leaving  only  16  TrpE  derived  amino  acids  on  the  5’  end  of  the 
expressed  SIVMAC  X  gene.  This  construct  yielded  a  smaller  fusion  protein  of  15kD. 
The  position  of  the  trp  promoter/operon  within  the  construct  is  indicated.  (B)  Western 
blot  analysis  of  bacterially-expressed  SIVMAC  X  proteins.  E.  coli  cell  lysates 
transformed  with  non-recombinant  vector  pATH  (lanes  1),  pATH-X  (lanes  2)  and  the 
TrpE  deleted  pATH-X  construct  (lanes  3)  were  tested  by  immunoblot  analysis  for 
reactivity  to  rabbit  sera  immunized  with  SIVMAC  and  HIV-2  X  peptides  (panels  B  and 
D,  respectively)  as  well  as  to  preimmune  rabbit  sera  (panels  A  and  C).  A  50kD 
TrpE/X  and  a  15kD  delta  TrpE/X  fusion  protein  were  recognized  only  by  X  peptide 
immune  sera.  (A  smaller  band  of  approximately  40kD  represents  a  prematurely 
truncated  version  of  the  50kD  TrpE/X  protein.)  An  antiserum  raised  against  the 
bacterial  TrpE  gene  product  (panel  E)  was  used  as  control  to  confirm  the  size  of  the 
50kD  TrpE/X  fusion  protein  (lane  2)  and  to  identify  the  non-recombinant  37kD  TrpE 
protein  (lane  1).  This  antiserum  failed  to  detect  the  TrpE  deleted  15kD  X  protein 
(lane  3).  Antigen  antibody  complexes  were  detected  using  125l  labelled  protein  A. 
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Figure  27.  Identification  of  the  HIV-2  and  SIV^  X  protein.  (A)  Immunoblot  of 
viral  preparations  probed  with  X  peptide  immune  and  preimmune  sera.  HIV-1  (isolate 
lllb,  lanes  1),  HIV-2  (isolate  ST,  lanes  2),  and  SIVMAC  viral  lysates  (isolate  PK82,  lanes 
3),  and  lysates  of  uninfected  cell  culture  supernatants  (lanes  4)  were  separated  on  a 
12.5%  polyacrylamide-sodium  dodecyl  sulfate  gel,  eiectrophoretically  transferred  to 
nitrocellulose,  and  reacted  to  the  following  sera  diluted  1:100  in  PBS  containing  5% 
nonfat  dry  milk  and  0.1%  Tween  20:  human  anti-HIV-1  positive  serum  (RR);  human 
anti-HIV-2  positive  serum  (ST);  rabbit  SIV  X  peptide  preimmune  (A)  and  immune 
serum  (B);  rabbit  HIV-2  X  peptide  pre-immune  (C)  and  immune  serum  (D).  Bound 
antibody  was  detected  by  reaction  with  horseradish  peroxidase  conjugated  anti-rabbit 
and  anti-human  immunoglobulins  using  diamianobenzidine  as  a  substrate.  (B) 
Immunoblots  of  sucrose  banded,  purified  HIV-2  virions.  HIV-2  viral  supernatants 
(isolate  ST)  were  clarified  by  low-speed  centrifugation,  virus  was  pelleted  through  a 
20%  sucrose  cushion  and  then  banded  in  a  20%  to  60%  continuous  sucrose  gradient. 
Immunoblots  were  prepared  and  identical  strips  containing  purified  HIV-2  virions  as 
antigen  (2')  were  reacted  with  an  anti-HIV-2  positive  human  serum  (ST)  and  the  rabbit 
HIV-2  X  peptide  immune  sera  (D).  (C)  Competitive  adsorption  assay  using  bacterially 
expressed  SIV^  X  protein  to  adsorb  anti-X  antibodies.  Bacterial  lysates  of  pATH- 
X  transformed  E.  coli  immobilized  on  nitrocellulose  were  incubated  with  SIV^  and 
HIV-2  X  peptide  immune  sera  (panel  B’  and  D’,  respectively).  Preadsorbed  sera  were 
subsequently  tested  by  Western  blot  analysis  for  reactivity  with  preparations  of  HIV-2 
(isolate  ST,  lanes  2)  and  SIVMAC  (isolate  PK82,  lane  3). 
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Figure  28.  Analysis  of  human  sera  for  reactivity  with  the  SIV^c  X  proteins 
expressed  in  E.  coll.  Cell  lysates  transformed  with  pATH  vector  (lanes  1),  pATH- 
X  (lanes  2)  and  the  TrpE  deleted  pATH-X  construct  (lanes  3)  were  separated  on  a 
12.5%  acrylamide  sodium  dodecyl  sulfate  gel,  transferred  to  nitrocellulose  and 
analyzed  by  immunoblotting  for  reactivity  to  four  HIV-2  antibody  positive  human  sera 
(a-d),  four  HIV-1  positive  human  sera  (e-h)  and  2  normal  control  sera  (i,  j).  Sera 
were  diluted  1:50  in  PBS  containing  5%  non-fat  dry  milk  and  0.1%  Tween  20.  Arrows 
denote  the  position  of  the  50kD  TrpE/X  and  the  15kD  delta  TrpE/X  fusion  proteins 
recognized  bv  two  of  four  HIV-2  positive  human  sera  (a  and  c).  Bound  antibody  was 
detected  by  a\  labelled  protein  A. 
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partial  Xbat  digest  X  JSF4-27 

tubcloning  into  pSP65 


— >R  .........  IQ 0  .  . 

AGTCGCTCTGCGGAGAGGCTGGCAGATTGAGCCCTGGGAGGTTCTCTCCAGCACTAGCAGGTAGAGCCTCGGTGTTCCCTGCTAGACTCTCACCAGTGCTTGGCCGGCACTCCGCAGACG 

.  R< - >U5.  .  200  .... 

GCTCCACCCTTGCTTCCTTAAAAGACCTCTTAATAAAGCTGCCAGTrAGAAGCAAGrTAACrCTCrGCTCCCArCrCrCC7AGrcCCCGCCrGGTCATrCCCrGrrCArcrAAACrAACA 
.  •  •  U5  < — I  PBS  ..... 

AGAecCTGGTCTGTTAGGACCCTTTCTCCTTTGGCAAACCAAGGCAGCAAAATCCCTAGCAGGTTGSCGCCCCAACACGGACTTGAAGAAGACTGACAACCCTTCGAACACGGCTGAGTG 

.  .  .  400  ....... 

AAGGCAGTAAGGGCGGCAGGAACAAACCACGACGGAGTGCTCCTAGAAAAGCGCAGGCCGAGGTACCAACGCCGGCGTGTCGACCGGGAGTGAAACAGGCCTCCGGGTGAACGTAAGTGC 

.  500  .........  600 

CTACACCAAATACAGTAGCCACAAGGGCTrGTTATCCTACCTTTAGACGCGTAGAAGArTGrGGGAGATGGCCGCGAGAAACrCCGTCrTGACAGGGAAAAAACCACACGAATTACAAAA 

gag  >  MatGlyAlaArgAsnSarValLauArgGly LysLy sAlaAspGl uLauGluLy 

700 

GArTAGGTTACGGCCCGGCGGAAAGAAAAAATATAGGCTAAAACATATTCTGTCGCCACCCAATGAATTGGACAGATTCCCATTGGCAGAGACCCTCTTCCAGTCAAAAGAGGCTTCCCA 
sIlaArgLauArgProGlyGlyLysLysLy sTyrArgLeuLysHia  XlaValTrpAlaAlaAsnGluLauAspArgPheGlyLauAlaGluSarLauLauCluSarLysGluGlyCyaGl 

800 

AAAAATTCTTACAGTTTTAGATCCATTAGTACCGACACGGTCAGAAAATTTAAAAAGCCTTTTTAAtACTCTCTGCCTCATTTCGTGTATACACGCAGAACAGAAAGCGAAACATACTGA 
nLyallaLauThrValLauAapProLeuVal ProThrGlySarGluA*nL«uLy*SarLauPh«AanThrValCysVaiIleTrpCy*IlaHi»AlaGluGluLyaAlaLysAf pThrGl 
.  .  .  .  .  900  ...... 

AGAAGCAAAACAAAAGGTACAGAGACATCTAGTGGCAGAAACAAAAACTACAGAAAAAATGCCAAGTACAAGTAGACCAACACCACCACCTAGCGGGAACGGAGGAAACTTCCCCGTACA 

uGluAlaLyaClnLyaVaiGlnArgHisLauValAlaGluThrLysThrthrCluLysMetProSerThrSarArgProThrAlaProProSarGlyAanGlyGlyAanPhaProValGl 

1000  ........ 

ACAAGTGGCCGGCAACTATACCCATGTGCCACTAAGTCCCCGAACCCTAAATCCTTGCGTAAAACTAGTAGAGGAAAACAACTTCGGGGCAGAACTAGTCCCAGGATTTCAGCCACTCTC 

nGlnValAlaGlyAanTyrThrHlaValProLeuSerProArgthrLauAanAlaTrpValLysLauValGluGiuLyaLysPhaGlyAlaGluValValProGlyPhaGlnAlaLauSa 

1100  .........  1200 

ACAAGGCTGCACGCCCTATGATATTAATCAAATGCTTAATTGTGTGGGCCACCATCAAGCAGCTATGCAAATAATCAGCGAAATTATTAATGAAGAAGCAGCAGATTGGGACGCACAACA 

rGluGlyCysThrProTyrAapIlaAsnGlnMetLauAanCysVaXGlyAapHisGlnAlaAlaMatGlnllallaArgGluIlallaAanGluGluAlaAlaAapTrpAspAlaClnKl 

1300 

CCCAATACCAGGCCCCTTACCAGCCGGGCAGCTCAGGGAGCCAAGGGGATCTGACATAGCAGGGACAACAAGCACAGTAGAAGAGCAGATCCAGTGGATGTTTAGGCCACAAAATCCTGT 
sProIlaProGlyProl«uProAiaGiyGlnLauArgGluProArgClySarA*pIlaAlaClyThrThrS«rThrValGluGluG!ftIlaGlnTrpMatPhaArgProGlnA*nProVa 
.  .  .  .  .  .  .  1400  .... 

ACCAGTAGGAAGCATCTATAGAAGATGGATCCACATAGGGC?ACAGAAGTGCCTCAGGATGTACAACCCAACCAACArCCTAGACATAAAACACGCACCAAAGGAGCCATTCCACAGTTA 
IProValGlySarHaTyrArgArgTrpI leGlnllaGlyLeuGlnLysCysValArgMatTyrAanProThrAsnl leLauAsp! laLyt GlnGlyP roLysGi uProPheGlnSarTy 
.  .  .  .  .  1500  .  ... 

tgtagatagattctacaagagcttgaggccagaacaaacagatccagcagtaaaaaattggatgacccaaacactgctagtgcagaatgccaacccagactctaacttagtactaaaagg 

rValAspArgPhaTyrLysSarLauArgAlaGluGlnThrAspProAlaValLysAsnTrpMatThrGlnThrLauLauValGlnAanAlaAsnProAspCysLysLeuValLauLysGl 

1600  ........ 

ACTAGGGATAAATCCTACCTTAGAAGAAATGCTAACCGCCrGTCAGGGGGTAGGTGGACCAGGCCAGAAACCCAGATTAATGGCAGAAGCCTTAAAGGAGCCCATGGCACCAGCCCCCAT 
y LauGly IlaAsnProThrLeuGluGluMet LauThrAlaCyaGl nGlyValGlycly ProGlyGlnLy sAlaAigLauMat AlaGluAlaLauLyaGluAlaMat AlaProAlaProIl 

1700  .........  1800 

CCCATTTGCAGCAGCCCAACACAGAAGCACAATTAACTGCTGCAATTGCGGAAAGGAAGGCCACTCGGCAAGACAATGCCGAGCACCTACAAGACAACGCTCCTGCAAATGTGCCAACGC 

pol  >  LysThrArgLauLauGiuMattrpGlnGly 

eProPhaAiaAlaAlaGlnGlnArgArgThr I XaLysCy *TrpAanCysGlyLy*GluGiyHi$S«rAlaArgGXrCy sArgAlaProArgArgGXnGlyCyaTrpLy sCy aClyLyaAl 

1900 

AGGACACATCATGGCAAAATGCCCAGAAAGACAGGCGGGTTTTTTAGGGTTGGGCCC^TGGGGAAAGAAGCCCCGCAATTTCCC7GrCGCCCAAArCCCGCAGGGCCTGACACCAACACC 
ArgThrHiiHliGlyLysMacProArgLya ThrGlyGXyPhaPhaArgVaXGlyProMacGlyLyaGluAlaProGlnPhaProCyaGlyProAanProAlaGlyAlaAspThrAanSar 
aGlyHi*HaMacAlaLy*CysProGluArgGlnAlaGlyPhaL«uGlyLBuGlyProTrpGlyLyaLy*ProArgAanPhaProValAlaGlnIleProGlnGlyL«uThr ProThrAl 
.  .  .  .  .  .  .  2000  .... 
acccccgatagacccagtagaggacctactacagaagtacatgcagcaaggcaaaaggcacagagaccagagagagaggccatacaaacaagtgacagacgacttcctgcagctcgacaa 

ThcProAspArgProSarArgGlyProThrArgGluValHisAlaAlaArgGluLysAlaGluArgAlaCluArgGluAlal laGlnArgSarAapArgGly LauProAlaAlaArgGlu 
aPcoProIlaA«pProVaiGluAspLeuL«uCluLysryrM«CGinGlnGlyLy*ArgCinArgGluCinArgGluArgProryrLy*GluValThrGluA*pPheLauGint«uCluI.y 

2100  ...... 

ACAAGAGACACCATGCACAGAGACGACAGAGGACTTGCTGCACCTCAATTCTCTCTTTGGAAAAGACCAGTAGTCACAGCACATGTTGAGGGCCACCCAGTAGAAGTTTTGCTAGACACA 
ThrArgAapThrMatGinArgAapAapArgGly LBuAlaAlaProGlnPhaSarLaurrpLysArgProVaiValThrAlaHiaValGXuGlyGXnProVaXGluVal LeuLauAapThr 
*GinCluThrProCy«ArgGluThrThrGXuAapLauL«ijHi*I,«uAsnSarI.auPh*GlyLy»AipGXn*** 

2200  ........ 

GGGGCTGACCACrCAArAGrAGCAGGCGTAGACTTAGGGAGCAATTATAGTCCAAAGATAGTAGGGGGAATACGCGCATTCATAAATACCAAAGAATATAAAAArGTAGAAATAACACTA 
GlyAlaAspAapSarllaVaXAlaGlyValGiuLauGlySerAanTy rSarProLy* IlaValGlyGly I XaGlyGly Phal  l*AsnThrLy*GluTy rLyaAanValGluIlaArgVal 

2300  .........  2400 

TTAAArAAAAGAGrAAGAGCCACCATAATGACACGTGATACCCCAATCAACATTTTTGGCAGAAACArTCTCACAGCCTTAGGCATGrCATTAAATCTACCAGTCCCCAAGATACAACCA 

LauAsnLysArgVaXArgAlaThrXleHatThrGXyAspthrProIlaAsnlXaPhaGXyArgAsnXXaLauThrAXaLauGlyMatSarLauAsnLauProVaXAXaLysIlaGXuPro 

2500 

ATAAAAATAATGCTCAACCCAGGAAAGGATGGACCAAAACTGACACAATGGCCCTTAACAAAAGAAAAAATAGAGGCACTAAAAGAGATCTGTGAGAAAATGGAAAGAGAGGGCCAGCTA 
IlaLyal  leMatLauLyaProciy Ly»AapGlyProLyaL«uArgGlnTrpProL*uThrI.yaGluLy* IlaGluAlaLauLyaGluIlaCyaGluLyaMatGluArgGluGlyClnLau 

2600  .... 
CACGACGCACCTCCAACTAATCCTTATAATACCCCCACATTTGCAATCAAGAAAAAGGACAAAAACAAATGGACAATGCTAATAGATTTTACAGAACTAAACAAGCTAACTCAAGACTTC 
GluGXuAlaProProThrAanproTy  rAanThrProthr  PhaAXal  laLy  a  Ly  aI.y»AapI.y*A*nLy*rrpArgMatLauIl#A*pPhaArgCluLauAanLy«ValThrGlnAapPh« 


Figure  30.  Complete  nucleotide  sequence  of  the  HIV-2/ST  provlral  genome.  Shown  are  9,762  bp  of  nucleotide 
sequence  and  the  deduced  amino  acid  sequences  of  the  corresponding  viral  proteins.  The  sequence  stans  at  the 
5‘  cap  site  and  ends  with  the  3‘  polyadenylation  site  of  the  viral  RNA.  The  primer-binding  site  (PBS;  complementary 
to  the  Lys-tRNA),  polypurine  tract  (PPT),  and  short  inverted  repeats  that  flank  the  LTRs  are  underlined.  Core 
enhancer  sequences  (E),  Spl -binding  sites  (Spl),  the  TATA  box  (TATAA),  and  the  polyadenylation  signal  (AATAAA) 
are  shown.  The  U3-R  and  R-U5  boundaries,  as  well  as  the  splice  donor  (SO)  and  splice  acceptor  (SA)  sites,  have 
been  determined  in  analogy  with  HIV-2/ROD.  The  vpr  open  reading  frame  contains  a  premature  in-frame  TAA  stop 
codon  at  position  5777  (***).  Sequence  analysis  was  performed  by  the  chemical  degradation  method  of  Maxam  and 
Gilbert  as  well  as  by  the  dideoxynucleotide-chain  termination  method  of  Sanger  et  al.  The  nucleotide  sequence  of 
JSP4-27  has  been  submitted  to  the  AIDS  Sequence  Data  Base,  Los  Alamos  National  Laboratories,  as  well  as  to  the 
GenBank  and  EMBL  Laboratories. 
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.  .  .  .  .  2700  ...... 

acagaaatccacttaggaattccacacccagcacgactagccaagaagaaacgaattactgtcctagatgtagcggatccttacttttccataccactacatgaggattttacacactat 
ThrGluI  laGlnLauGly  XlaProHiaProAlaGlyLauAlaLy  aLyaLyaArgl  laThrValLauAapValGlyAspAlaTyrPhaSarX  XaProLauHlaCluAapPhaArgGlnTyr 
.  .  .  2800  .  .  ...... 

actgcatttactctaccatcaataaacaatgctgaaccagcaaaaacatacatatataaagtctcaccacaggcatcgaagggatcaccagcaatttttcagtacacaatgacccaggtc 

ThcAlaPhaThrlauProSarll tAa  nAa  nAlaGluP  roGly Ly aArgTyr I laTyr  LyaValSar P  rod  nGlyTrpLysGiySarProAlal laPhaGlnTy rThrMatArgGlnVal 

.  2900  .....  2900  .  .  .  3000 

TTAGAACCATTCAGAAAACCAAACCCGGATATCATTCTCATTCAGTACATGGATGATATCTTGATAGCCAGCGaCAGGACAGATTTAGAACATGACAGAGTGGTTCTGCAGCTAAAGGAA 
LauGluProPhaArgl.yaAlaAsnProAapIlaIlaLauIlaGlnTy rMat AapAap  I  lalauIlaAlaSarAapArgThrAapLauGluHiaAapArgValValLauClnLauLyaGlu 
.  .  •  .  .  .  .  .  3100 

CTTCTAAATGGCCTGGGATTttCCACCCCAGATGAGAAGrTCCAAAAAGACCCTCCATACCAATGGATGGGCTATGAACTCTGGCCAACTAAATGCAACCrGCAAAGAATACAATTCCCC 
LauLauAanGlyLauGlyPhaSarThrProAapGluLyaPhaGlnLyaAapProProTyrGlnTrpMatClyTyrGlulauTrpProThrLyaTrpLyaLauGlnArgl  laGlnLauPro 

3200  .... 

CAAAAGGAAGTATGGACAGTCAATGACATCCAAAAACTGGTGGGTGTCCTAAATTGGGCAGCACAAATCTACCCAGGCATAAAGACCAGAAACTTATCTACGTTAATCAGAGGAAAAATG 
GlnLyaGluValTrpthrValAanAapI laGlnLyaLauValGly ValLauAanTrpAlaAiaGlnl laTy r P  roGly  I  laLyathrArgAa  nLauCy aAr gLauI laArgGlyLyaMat 
.  .  .  .  .  3300  ...... 

ACACTCACAGAAGAGGTACAGTGGACAGAATTAGCAGAAGCGGAACTAGAAGAAAACAAAATCATCTTAACCCAGGAACAAGAAGGATGCTATTACCAAGAGGAAAAGGAGCTAGAAGCA 
ThrLauThrduGluValGlnTrpThrGluLauAlaGluAlaGluLauGluduAanLyal  lal  laLauSarGlnGluClnduGlyCyaTyrtyrGlnCluCluLyaGluLauGluAla 
.  -  .  3400  ........ 

ACAGTCCAAAAAGATCAAGACAATCAGTGGACATATAAGATACACCAGGGAGGAAAAATTCTAAAACTAGGAAAATATGCAAACCTAAAAAATACCCACACCAACCCAGTCAGACTCCTA 
ThrValGlnLyaAapGlnAapAanGlnrrpThrTyrLyanaHlaGlnGlyGlyLy*  llaLauLyaValGlylyaTyrAlalyaValLyaAanThrHlaThrAanGlyValArgLauLau 

.  3500  .........  3600 

GCACAAGTAGTTCAAAAAATAGGAAAAGAAGCACTAG?CATTTGGGGACGAATACCAAAATTTCACCTACCAGTAGAAAGAGATACCTGGGAACAGTGG?GGGATAACTACTGGCAAGTG 
AlaGlnValValGlnLyallaGlyLyaGluAlaLauValllaTrpGlyArgllaProLysPhaHlaLauProValGluArgAapThrTrpGluGlnTrprrpAapAanTyrTrpGlnVal 
.  .  .  .  .  .  .  .  .  3700 

ACArGGATCCCAGACTGGGACTTCATATCTACCCCGCCACTGGTCAGATTAGTATTTAACCTGGTGAAAGATCCCATACTAGGCGCAGAAACCTTCTACACAGATCCATCCTGCAATAAG 
ThrTrpIlaProAapTrpAapPhallaSarThrProProLauValArgLauValPhaAanLauValLyaAapProIlaLauGlyAlaGluThrPhaTyrThrAapGlySarCyaAanLya 
.  .  .  .  .  .  .  3800  .... 

CAATCAAGAGAAGGAAAAGCAGGATACATAACAGATAGAGGAAGAGACAAGGTGACGCTATTAGAGCAAACCACCAATCAGCAAGCACAATTAGAACCCTTTGCGATCGCAGTAACAGAC 
GlnSarArgGluGlyLyaAlaGlyTy  r  HarhrAapArgGlyArgAapLysValArgLauLauGluGlnThrThrAsnGlnGlnAlaGluLauGluAlaPhaAlaMatAlaValThrAsp 
.  .  .  .  .  3900  ...... 

TCAGGTCCAAAGGCCAACATTATAGTAGACTCACAATATGTAATCGGAATAGTAGCAGGCCAACCAACAGAGTCAGAGAGTAAAATAGTAAATCAAATCATAGAAGAAATGATAAAAAAG 
SarGlyProLyaAlaAanllallaValAapSarGlntyrValMatGlyllaValAlaGlyGlnProThrGluSarGlusarlyallaValAanGlnllallaGluGluMatllaLyaLya 
.  .  .  4000  .  .  ...... 

GAAGCAATCTATGTTGCATGGGTCCCAGCCCATAAAGGCATAGGAGGAAATCAGGAGGTAGATCACTTAGTAAGTCAGGGCATCAGACAAGTATTATTCCTAGAGAAAATACAACCCCCT 

GluAlallaTyrValAlaTrpValProAlaHiaLytGlyllaGlyGlyAanGlnGluValAapHlaLauValSarGlnClyllaArgGlnValLauPhaLauGluLyallaGluProAla 

.  4100  .  .  ^  .....  4200 

CAGGAGGAACATGAAAAATATCATAGCAATGTAAAACAACTATCCCATAAATTTGGACTGCCCAAATTAGTGGCAAGACAAATAGTAAACACATGCACCCAATCTCAGCAGAAAGGGGAG 
GlnGluGluHiaGluLyaTyrHiaSarAanValLyaGluLauSarHiaLyaPhaGlyLauProLyaLauValAlaArgGlnllaValAanThrCyaThrClnCyaClnGlnLyaGlyGlu 
.  .  .  .  .  .  .  .  .  4300 

gctatacatgggcaagtaaatgcagaattaggcacttggcaaatggactgcacacacttagaaggaaaaatcattatagtagcagtacatgttgcaagtggatttatagaagcagaagtc 
Alai  laHlaGlyGlnValAanAlaGluLauGlyThrTrpGlnMatAapCy  aThrHlaLauGluGly  LyallallallaValAlaValHlaValAlaSarGlyPhallaGluAlaGluVal 

4400  .... 

ATCCCACAGGAATCAGGAAGGCAAACGGCACTCTTCCTACTAAAACTGGCCAGTAGGTGGCCAATAACACATTTGCACACAGACAATGGTGCCAACTrCACrTCACAGGAAGTAAACArG 
I  laProGlnGluSarGlyArgGlnThrAlaLauPhaLauLauLy  a  La  uAl  a  SarArgTrpP  roll  aThrHisI.au  Hi  SThrAapAanGlyAlaAanPhaThrSarGlnGluValI.yaMat 
.  .  .  .  .  4500  . 

GTGGCATGGTGGATAGGTATAGAAC  AATCCTTCGGAGTACCTTACAATCCACAAAGCCAAGGAGTAGTGGAAGCAATGAATCACCACCTAAAAAATCAGATAACCAGAATTAGAGAGCAG 
ValAlaTrpTrpIlaGlyllaGluGlnSarPhaGlyValProryrAanProGlnSarGlnGlyValValGluAlaMacAsnHiaHisLauLyaAanClnllaSarArgllaArgGluGln 
.  .  .  4600  ........ 

GCAAACACAGTAGAAACAATAGTACTAATGGCAGTTCATTGCATGAATTTTAAAAGGAGGGGAGGAATAGCGGATATGACCCCAGCAGAAAGACTAATCAATATGGTCACTGCAGAACAG 
AlaAanThrValGluThr  I  la ValLauMatAlaVal  HisCysMatAanPh  a  LyaArgAr gGlyGlyIlaClyAapMatThrProAlaGluArgl.au  I  laAanMatValThrAlaGluGln 

.  4700  .........  4800 

CAAATACAATTCCTCCAAGCAAAAAATTCAAAATTACAAAATTTTCGGGTCTATTTCAGAGAAGGCAGAGATCAGCTGTGGAAAGGACCTGGGCAACTACTGTGGAAGGGGGACGCAGCA 
GluIlaGlnPhalauGlnAlaLysAsnSa  rLy  sLeuGlnAanPhaArgValTy  rPhaArgGluGlyArgAapGlnLauTrpLyaGlyProGlyGiuLauLauTrpLy  aGlyAapGlyAla 

4900 

GTCATAGTCAAGCTAGGGGCTGACATAAAAATAATACCAAGAAGGAAAGCTAACATCATCAAAGACTATGGAGCAAGGCAACACATCCATACCGGTTCCAACTTCGACGGTCCCACGGAC 
ValllaValLyaValGlyAlaAapIlaLyal  lallaProArgArgLyaAlalyallallaLy  aAapTy  rGiyGlyArgclnGluMatAapSa  rGlySarAanLauCluClyAlaArgGlu 

vl£  >  MatGluGluGlyLyaArgTrpI laAlaValProthrTrpArgValProClyAr 
5000  .... 

GATGGAGAGGTGGCATAGCCTTATCAAGTATCTAAAATACAGAACAGGAGATCTAGAGAAGGTGTCCTATGTTCCCCACCATAAGGTGGGATGGGCGTGGTCGACTTGCAGCAGGCTAAT 
AapGlyGluValAla*  *  * 

gMatGluArgTrpHlaSarLauIlaLyaTyrLauLyaTyrArgThrClyAapLauGluLyaValCyaTyrValProHlaHlaLysValGlyTrpAlaTrprrpThrCyaSarArgValll 

5100  ...... 

ATTCCCATTAAAAGGAGAAAGTCATCTGCAGATACAGGCATACTGGAACCTAACACCAGAAAAAGGATGCCTCTCCTCCTATTCAGTAACACTAACTTCGTATACAGAAAAATTCTCGAC 
•PhaProLauLysGlyCluSarHisLauGluI  laGlnAlaTyrTrpAanLauThrPr  oGluLyaGlyTrpLauSarSarTy  rSarValArgLauThrTrplyrThrGluLyaPhaTrpTh 

AGATGTTACCCCAGACTGTCCGGACTCCCTAATACATAGCACTTATTTCTCTTGCTTTACGGCAGGCGAAGTAAGAAGAGCCATCAGAGGCGAAAAGCTATTATCCTGCTGCAACTACCC 
rAapValThrProAapCyaAiaAapSarLauI  laHlaSarThrTyrPhaSarCy  aPhaThrAlaGlyGluValArgArgAlal  iaArgGlyGluLysLauLauSarCy sCy sAsnTyrPr 

5300  ....  .....  5400 

CCAACCCCATAACTACCACGTACCGTCACTCCAGTTTCTGGCCTTAGTGGTAGTGCAACAAAATGGCACGCCCCACACAGACAATACCACCAGGAAACAGTGGCGAAGAAACTATCGGAG 
oClnAlaHlaLyaTyrGlnValProSarLauGlnPhaLauAlaLauValValValGl  nGlnAsnGlyArgProGlnArgAspAsnThrThrArgLysGlnTrpArgArgAsnTyrArgAr 

»p*  >  MatAlaGlyProArgGluThrliaProProGlyAanSarGlyGluGluThrllaGlyG 


Figure  30  (cont.) 
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.  .  .  .  .  .  .  .  .  5500 

AGGCCTTCGAGTGGCTAGACAGGACGGTAGAAGCCATAAACAGACAGGCAGTGAACCACCTGCCCCGAGAGCTTATTTTCCAGGTGTGGCAAAGGTCCTGCAGATACTGGCATCATCAAC 
gGlyl»auArgValAlaArgGlnAapClyArgSarHl»LyaClnArgGlySarGluProProAXaProArgAlaTyrPhaProGiyValAlaLyiValLauGluZlaLauAla### 
luAlaPhaGluTrpLauAapArgThrValGiuAlaliaAartArgGluAlaValAsnHiiLauProArgGluLauHaPhaGinValTrpGinArgSarTrpArgTyrTrpHleAapCluC 

5600  ..... 

AAGGAATGTCAATAAGTTACACAAAGTATAGATArrTGTGCCrAArGCACAAAGCrATGTTCATACATTCrAAGAGAGGGrGCACTTGCCrGGGGCCAGCACATCGGCCCCGAGGATGCA 
lnGXyMatSarXlaSarTyrThrLytryrArgTyrLauCytLauMatGlnLyaAlaMatPhaXlBHlaSarLytArgGlyCyeThrCyeLauGlyGlyGlyHltGlyProClyClyTrpA 

5700  .  . 

GATCAGGACCTCCCCCTCCTCCCCCTCCAGGTCTAGTCTAATGACTGAAGCACCAACAGAGTCTCCCCCGGACGATAGGACCCCACCGACGGAGCCAGCGGATCAGTGCCTAATAGAAAC 
rgSarGlyProProProProProProProGlyLauVal* •* 

vpr  >  MacThrGluAXaProThrGXuSarProProGXuAspArgThrProProArgGXuProclyAapGXurrpVallXcGXurh 
.  .  .  5800  ........ 

CCTGAGAGAGATAAAATAAGAAGCTTTAAAGCACrrTGACCCrCGCTTGCrAArrACTCTTGGCAACTATArCTATGCrAGACATGGAGACACCCTTGAACCCCCCACAGGGCTCATTAG 
rLauArgGluX laLy *EadGluAlaLauLy  *Hi*PhaA*pProArgLauLauXlaThrLauGlyA*nTyrXlaTy rAlaArgHiaGlyAapThrLauGluGXyAlaArgGly Laul XaAr 

tat  >  Ma tGluThr P  roLauLy * AlaProGluGly SarLauG 

.  5900  .........  6000 

GATCCTACAACGAGCCCTCCTCTTGCACTTCAGAGCACGATGCGGCCGCTCAAGGATTGGTCAGCCCAGGGGACGAAATCCTTTATCAGCTATACCAACCCCTAGAGGCATGCCATAACA 
gZleLauGlnArgAlaLauLauLauHiaPhaArgAlaGlyCy$GlyArgSarArgIlaGXyGlnProArgGlyArgAanProLauSarAXaZiaProThrProArgGXyMatArg*#» 
lySarTyrAanGluProSarSarCysThrSarGluGlnAapAlaAlaAlaGlnGlyLaaValSarProGXyAapGluIlaLauTyrGlnLauTy rGlnProLauGluAlaCy aAapAanL 
,  .  .  .  .  .  .  .  .  6100 
AATGTTACTGTAAAAAGTGCTGCTACCATTGCCAGATGTGTTTTTTAAACAAGGGGCTCGGGATATGGTATGAACGAAAGGGCACAAGAAGAAGAACTCCGAAGAAAACTAAGGCTCAIT 
y«Cy*TyrCy*Z.yaLyiCy*CysTyrHXsCy*GlnMatCys?hal.auAsnLyaGlyLauGlyZlaTrpTyrGluArgLyaGlyArgArgAEgArgThrProLyaLyaThrZ.y*AXaHiaS 

rtf  >  MatAanGluArgAlaGluGluGluGXuLa-jArgArgX.yeLauArgLauZla 
SO  .  .  .  .  .  .  6200  .... 

eCTCTTCTGCATCACACAAGTSAGTAAGATGTCTCGTACGAATCAACTATTTGTTCCCAGCTTGCTAGCTAGTGCTTGCTTAATATATTCCGTCCAATATGTGACTCTTTTCTATGGCGT 

arSarSarAXaSarAspLys 

ArgLauLauHitGXnThrAan 

ana  >  MatCy sGlyArgAsnGlnLauPhaValAlaSarLauLauAlaSarAlaCyaLauZlaTy rCyaValGlnTyrValThrValPhaTyrGlyVa 

6300  ...... 

GCCCGTGTGGAGAAATGCATCCATtCCCCTCTTTTGTGCAACTAAAAATAGAGATACTTGGGGAACCATACAGTGCTTGCCAGACAATGATGACTATCAGGAAATAGCTTTAAATGTCAC 
XProVaXTrpArgAsnAXaSarZXaProLauPhaCysAXaThrLysAsnArgAspThrTrpGlyThrZlaGXnCysLauProAapAanAspAapTyrGXnGXuZXaAXaLauAtnVaXTh 
.  .  .  6400  ........ 

AGAGGCCTTCGACGCATGGAATAATACAGTAACAGAACAAGCAGTAGAAGATGTCTGGAGTCTATTTGAGACATCAATAAAACCATCCGTCAAACTAACACCCTTATCTGTAGCAATGCG 

cGluAlaPhaA*pAlaTrpA*nA*nThrValThrGluGXnAiaVaiGluA«pVairrpsarI,BuPhaGiurhrSarZlaI,y*ProCy*ValLy*tauThrProLauCyaVaiAlaMatAr 

6500  .........  6600 

TTGTAACAGCACAACrGCAAAAAACACAACCrCCACACCAACAACCACCACAACAGCAAACACAACAATACGAGAGAATTCTTCATGCATACCCACAGACAACTGCACACGGTTCGGACA 

gCyaAanSarThrThrAiaLysAsnThrThrSarThrProThrThrThrThrThrAlaAtnThrThrXlaClyGluAansarSarCyaXlaArgThrAspAanCyaThrGlyLauGlyGl 

6700 

AGAAGAGAtCGTCGACTGTCAGTTCAATATGACAGGATTAGAGAGGGATAAGAAAAAACTATATAATGAAACATGGTACTCAAAAGATGTAGtCTGTGAATCAAATGACACCAAGAAAGA 
uGluGluMatValAapCysGInPhaA*nM#tThrGlyLauGluArgA»pLy#LyaLy*LauTyrA*nGlurhrTrptyrSarLyaA»pValValCy  aGiuSarAanAapThrLyaLyaGi 

6800  .... 
GAAAACATGTTACATGAACCACTGCAACACATCAGTCATCACAGAGTCATGTGACAAGCACTATTGGGATACTATCAGGTTTAGATATTGTGCACCACCGGGTTTTGCCCTGCrAAGArG 
viLyaThrCy*TyrMatA*nHl*Cy*A*nThrSarVaiIlaThrGluSarCy*AapLy*Hi*TyrTrpA#pThrMatArgPhaArgTyrCyaAXaProProGlyPhaAlaLauLauArgCy 
.  .  .  .  .  6900  ...  ... 

caatgataccaattattcaggctttgaccccaattcttctaaggtagtacctgctacatgtacaagcatgatggaaacgcaaacctccacttgctttgcctttaatggcaccagcgcaca 

«A*nA*pThrAanTyrSarGiyPhaGluProA*nCy«SarLy*ValVaiAlaAlaThrCy*ThrArgMatMatGiuThrGlnThrSarThrTrpPhaGlyPhaA*nGXyThrArgAlaGl 
.  .  .  7000  ........ 

AAATAGAACATATATCTATTGGCATGGTAGGGATAATACAACCATCATTAGCTTAAACAAGTTTTATAATCTCACCGTACATTGTAAGAGGCCAGGAAACAAGACAGTTGTACCAATAAC 
uAanArgThrTy  rl  XaTyrTrpHisGiyArgAapAsnArgThr  Z  XaZiaSarLauA»nLy«PhaTyrAanLauThrValHX»Cy*Ly*ArgProClyA»nLy»ThcVaiValProZlaTh 

.  7100  .  ........  7200 

ACTCATGTCAGGGTTAGTGTTTCACTCCCAGCCAAtCAATAGAAGACCCAGGCAAGCATGGTCCTGGTTCAAAGGCGAGTGGAAGGAAGCCATGAAGCAC^TGAAGCTAACCCTTCCAAA 
rLauMacSarGlytauVal PhaHi  iSarGl nProI laAanArgArgProArgGInAlaTrpCyaTrpPhaLy tGlyGluTrpLy iGluAlaMatLy «GluValLytLauThrLauAlaLy 

7300 

ACATCCCAGGTATAAAGGAACCAACGACACAGAAAAAATTCGTTTTATAGCGCTAGGAGAACGCTCAGACCCAGAAGTGGCATACATGTGCACTAACTGCAGAGCAGAATTTCTCTACTG 
«HX*ProArgTyrLysGlyThrA*nA*pThrGluLysllaArgPhaXlaAlaLauGlyGluArgSarA*pProGiuValAlaTy rMacTrpTh  :  A inCyaArgGlyGluPhaLauTyrCy 

7400  .... 

CAATATGACTTGGTTCCTCAATTGGGTAGAAAACACAACGAATCAGACACAGCACAATTATGTGCCAtGCCATAtAAAGCAAArAArTAAIACCrGGCACAAGGTAGGGAAAAATGTArA 
sAsnMacThrTrpPhaLauAtnTrpVaXGXuAsnArgthrAsnGXnThrGXnKlsAtnTyrVaXP  roCysHlsZXaLytGXnZlaZXa/vtnThrTrpHisLy  tValGXyLysAf  nVaXTy 

7500  ...... 

TTTGCCTCCTAGGCAACGACAGTTAACCTGCAACTCTACAGTGACCAGCATAATTGCTAACATTGACGGAGGAGAGAACCAGACAAATATTACCTTTAGTGCAGACGTGGCAGAACTATA 
rLauProProArgGluGlyGlnLauThrCysAanSarThrValThrSar  XlaXlaAlaAsnllaAapGlyGlyGluAsnGlnThrAsnllaThrPhaSarAlaGluValAlaGluLauTy 

7600  ........ 

CCGATTAGAATTGGGCGATTATAAATTGATAGAAGTAACACCAATTGGCTTTGCACCTACACCAGTAAAAAGATACTCCTCTGCTCCACTGAGGAATAAAAGAGGTGTATTCGTCCTAGG 

rArgLauGluLauGlyAapTyrLytLauIlaGluValThrPrqZlaGlyPhaAlaProThrProVaiLy*ArgTyrSerSarAlaProValArgAar\Ly»ArgGlyvalPhaVaiLauGl 

7700  .........  7800 

GTtCTTAGGTTTTCTCACGACACCAGGAGCTGCAATGGGCGCCGCGTCCTTGACGCTGTCGGCrCAGTCTCGGACTrTATTGGCCGGGATACTGCAGCAACACCAACAGCTGTTGGACCt 
y  PhaLauGlyPha  LauThrThrAlaGlyAl  a  AlaMacGlyAl  aAlaSa  rLauThrLau  SarAlaGlnSarAr  gThrLauLauAlaG  ly  I  laValGlnGlnGlnClnGlnLauLauAapVa 

......  7900 

ggtcaagagacaacaagaaatgttgcgactgaccgtctggggaacaaaaaatctccaggcaagagtcactgctatcgagaaatacttaaaggaccaggcgcaactaaattcatgccgatg 

lValLyiArgGlnGlnGluMatLauArgLauThrValTrpGlyThrLysAanLauGlnAlaArgValThrAlallaGluLycTyrLauLyeAapGlnAlaClnLauAcnSarTrpClyCy 
.  .  .  .  .  .  .  8000  .... 
tgcgtctacacaagtctgccacactactgtaccatgggtaaatgacaccttaacgcctgattggaacaacatgacatggcagcaatgggaccaacgaatccgcaacctagaggcaaatat 

*AlaSarArgGinVaXCytHi*ThrThrVaXProTrpValAtnAipThrLaurhrProA«prrpA*nA*nMacThrTrpclnGluTrpGluGlnArgllaArgA»nLauGluAiaAanZl 
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•100  ...... 

CAGTGAAAGTTTACAACAGGCACAAATCCAGCAAGAAAAGAACATGTATGAACTACAAAAATTAAATAGCTCGCATGTTTTTGGCAACTGGrTTCATTTAACCTCCrCGATCAAATArAr 
•  SarGluSarLauGluGlnAlaGl  nl laClnGlnGluly *AanMatTyrGluLauClnLy*LauAanSarTrpA*pValPh«GlyA»nTrpPhaAspL«uThrSarTrpIlaLy»Tyr  1 1 
.  .  .  3^00  ........ 

TCAGtATGGAGTTTATATAGTAGTAGGAATAATAGTTTTAAGAATACTAATATATGTAGtACAAATGTTAAGTACACTTAGAAAGCGCTATAGCCCTGTTTTCTCTTCCCCCCCCGCTTA 

aGlnTyrClyValTyrIleValValGlyXlaXlaVall.av>ArgIlaVAlIlaTyrValValGlnHatLauSarArgLauArgLyaGlyTyrArgProVAlPhaSarSarProProAlaTy 

SA  8300  .........  8400 

CTTCCAACACATCCATATCCACAAGGACCGGGAACAGCCACCCAGACAACAAACAGAACAAGACCTTCGAAACAGCCTTGGAGACAATTGGTGGCCCTCGCCGATAAGATATATACATTT 
rPh«GlnGlnXlaHLstlaHla£.y8AspArgGluGlnProAiaArgGluGluThrGluGluAapValGlyAan5arVaXGlyAapAARTrpTrpProTrpProllaAr9TyrXlaM.laPh 
tat  >  SarZl«s«tThrAr9ThrGlyA«RSarclnProGluLyaLy«GlnLyaLyaThrLauGluThrAlaLauGluThrXlaGl.yGlyProGlyAr9*** 
rar  >  ProryrProGlnGlyProClyThrAlASarGlnArgArgAanArgArgArgArgTrpLyaGlnArgTrpArgGlnLauValAlaLauAiaAapLyaIl.aTyrThrPh8 

0300 

CCTGATCCGCCAGCTGATtCGCCTCTTGAACAGACTATACAACATCTGCAGGGACTTACTATCCAGGAGCTTCCAGACCCtCCAACTAATCTCCCAGAGTCTTCGGACACCATTGACAGC 
aLauIlaArgGlnLauIleArgLauLauAanArgLauTyrAanllaCyaArgAapX.auLauSarArgSarPhaClnThrl.AuClnLauIlaSarGlnSarLauArgArgAiAX.aurhrAl 
ProAspProProAiaAspSarProX.auGluClnThrXlaGlnHlaL«uGlnGlyLauThrXlaGlnGluLauProAspProProThrA«nLauProGluSarSarGluSarXlaAapSar 
.  .  .  .  .  .  .  8600  .... 
AGTCAGAGACTGGCTGAGATTTAACACAGCCTACCTGCAATATGGGGGCGAGTGGATCCAACAAGCCTTCCGACCCTTCGCGACGGCTACGCGAGAGACTCTTACAAACCCCTGCACACG 
aValArgAapTrpLauArgPhaAanThrAlaTyrLauGlnTyrGlyClyCluTrpllaGlnCluAlaPhaArgAlaPhaAlAArgAlaThrClyGluThrlauThrAanAlATrpArgCl 
SerGlnArgLauAlaClutle*** 

naf  >  MatGlyAlaSarGlySarLyaLyaArgSarGluProSArArgGlyLauArqCluArqLauLauGlnThrProClyCluA 
8700  ...... 

CTTCTGGGGGACACTGGGACAAATTGGGAGGGGAATACTTGCAGTCCCAAGAAGCATCAGGCAGCGGGCAGAAATCGCCCTCCTGTGAGGGACGCCCGTATCAACAGGGACAtTTTATGA 

yPhaTrpGlythrLauGlyClnIlaGlyArgClyIlaI.«uAlaValProAr9Argll«ArgClnClyAlaGluIlaAl*LauLau*** 

lASarGiyGlyHlaTrpAapLyaLauGlyGlyCluTyrLauGXnSarGlnCluGlySarGlyArgGlyGlnLyaSarProSatCytGluGlyArgArgTyrGlnClnGlyAapPhaMatA 

8800  ........ 

ataccccatggagagccccaccagaagggcacaaaggctcgtacaagcaacaaaatatccatgatgtagattcagatgatcatgacctactagccgtccctctcacaccaagagtaccat 

anThrProTrpAr9AlaProAlaGluGlyGlul.yaGlySarTyrLyaGlnGlnAanMatAapAapValAapSarAapAapAapAapLauValGlyValProValThrProAr9ValProL 

.  8900  .  .  .  PPT  .  |—  >03.  ....  9000 


«uArgGluMacThrTyrAr9Z.auAlaAxgAap*acSarHiaI,auIlaLyaCluLyaGlyGly2.auGluClyLauTyrTyrSarAapAr9ArgArgArgValX.auAapIl«TyrLauGliiI. 
.  .  .  .  .  .  .  .  .  9100 

ACGAAGACCCAATAATTGGAGACTGGCACAACTATACTCATGGACCAGCAGTAAGCTATCCAAAGTTCTTTGCGTCGTTATCGAAGCTAGTACCAGTAGATGTCCCACAAGAGGCACATG 
yaGluCluCly  IlaXlaGlyAapTrpGlnAanTyrThrHlaGlyProGlyValArgTy  rProLyaPhaPhaGiyTrpLauTrpLyaLauVaiProValAapValProGlnGluClyAapA 
.  .  .  .  .  .  .  9200  .... 

ACAGTGAGACTCACTCCTTAGrGCATCCAGCACAAACAAGCACGTTTGATGACCCGCATGGAGAAACATTAGTTTGGAGGTTTGACCCCACGCTACCTTTTAGCTACGAGGCCTTTATTC 
apSarGiuThrHiaCy«LaviValHiaProAlaGlnThrSarArgPhaAapA*pProKiaGlyGluThrLauValTrpArgPhaAapProThxl,auAlaPhasarTyrCluAlaPhaXlaA 
.  .  .  .  .  9300  ....  .  . 

CATACCCACAGGAGTTTGGGTACAAGTCAGGCCTGCCAGAGGATGAATGGAACGCAAGACTGAAAGCAACAGGGATACCGTTTAGCTAAAAACACCAACAGCTATACTTCCTCACGGCAC 


rgTyrProGluCiuPhaGlyTyrLyaSarGlyLauProcluAapCluTrpLyaAlaArgLauLy*AlaArgGlyllaProPhasar#** 

.  .  *  9400  B  Spl  SpI  Spl  ... 

CAAeTAACTAACACAAAACAGCTCAGACTCCAgGGACrrrCCAGAACCGCCTGTTACCACGCGACGGACATCGGACGACCCGGTGGgSAACGCCCTCATACTTTCTGTATAAATCTACCC 

03<— >*  .........  9600 

CCTACTCCCATTGTATTCAGTCGCTCTGCCGAGACGCTCGCAGATTGACCCCTGGCACGTTCTCTCCAGCACTAGCAGGTAGACCCTCGGTGTTCCCTGCTAGACTCTCACCAGTGCTTC 


CCCGGCACTGGGCAGACGGCTCCACGCTTGCTTCCTTAAAACACCTCTTAATAAAGCTGCCAGTTAGAACCA 


Figure  30  (cont.) 


99 


Envelop*  Amino  Add  S*qu*nc*  Ofevgone* 


HIV-2/ST 

HiV-2/ROD 

HIV-2/ISY 

HIV-BNIHj 

H1V-2/GH 

SIV/MACjjj 

SIV/SM 

HIV-2/ST 

19% 

17% 

18% 

16% 

28% 

28% 

HIV-2/ROD 

11% 

20% 

19% 

18% 

28% 

28% 

HIV-2/ISY 

10% 

11% 

20% 

19% 

30% 

29% 

HIV-2/NIHj 

12% 

12% 

13% 

19% 

28% 

28% 

HIV-2/GH 

11% 

12% 

12% 

15% 

29% 

29% 

siv/mac142 

23% 

23% 

24% 

26% 

24% 

19% 

SIV/SM 

23% 

22% 

23% 

23% 

23% 

15% 

Toert  Nuettotd*  SoQuano*  Otovgano* 


Figure  31.  Nucleotide  and  amino  acid  sequence  divergence  among  HjV-2  and 
SIV  strains.  The  percent  nucleotide  sequence  divergence  between  HIV-2/ST  (JSP4- 
27),  HIV-2/ROD,  HIV-2/ISY.  HIV-2/GH,  SIVMAC142,  and  SIVSM  is  shown  along  with  the 
percent  amino  acid  sequence  divergence  of  their  envelope  glycoproteins.  Sequences 
were  aligned  pairwise,  using  the  Microgenie  computer  software  (Beckman). 
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%  Homology 

Open  reading  frame  ST/ROD  ST/1SY  ROD/1SY 


Nucleotide 

Amino  acid 

Nucleotide 

Amino  acid 

Nucleotide 

Amino  acid 

LTR 

91.5 

91.4 

91.8 

gag 

91.1 

92.0 

90.7 

89.1 

90.3 

89.5 

pol 

91.2 

91.4 

91.2 

91.4 

91.2 

91.7 

V./ 

91.5 

88.4 

92.4 

91.2 

92.2 

85.6 

vpx 

89.3 

85.7 

92.6 

90.2 

88.4 

87.5 

vpr 

89.9“ 

80.0“ 

89.8“ 

84.8“ 

93.3 

89.5 

tat 

86.7 

75.4 

88.7 

78.5 

90.5 

80.8 

rev 

82.0 

85.0 

ND* 

ND* 

ND* 

ND* 

env 

85.5 

81.4 

86.0 

83.0 

84.8 

80.4 

nef 

86.1 

78.9 

85.8 

77.7 

87.0 

78.5 

Overall  %  homology  89.5  89.9  89.3 


Figure  32.  Sequence  homologies  among  virus-specific  genes  of  three  HIV-2 
proviruses. 

•An  in-frame  stop  codon  is  present  in  the  HIV/S/ST  vpr  open  reading  frame. 
bMeaningful  comparison  of  the  HIV-2/ISY  rev  gene  with  the  corresponding  rev  genes 
of  HIV-2/ST  (JSP4-27)  and  HIV-2/ROD  was  not  possible  because  of  considerable 
length  differences  between  their  sequences.  ND,  Not  done. 
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•  •  *  *•  •  •  •  •  •  128 
HC6  RMOI.rVASILASACLITCV0TVTVrYGVPVWRl!AgI  PLrCATKNROTMGTIQCLPDNDOYQEIALgVXEArDAWiqflVTEQAVtDVmLrETSIKFCVELTPLCVAMRCMT.  . .  TAKHTTS-T 
-H.  .  . LI-I - V— ' T - T-K— T - - 1 - 1 - H - K-S—ESS-CM - KS 

-GCL  C LI-I— L-VYC T - A - T - T - G — S-L - * - E - 1 - Q - S - IT - KSETDRMCL-R-S- 

-GCLG LX  -L-— V-VLE-C - A-K — T - T - S-L-I - 0 - 1 - N - 1 - K-ETD  RMCL-GNAG 


HIV— 2/ST 

HIV-2/R0D 

KXV-2/ISY 

HIV-2/HXH- 

HI V-2/CH 

SIV/MAC 

SIV/SH 


•  ••  ••••  •  •  2SC 

PTTTTTAHTT . ICEHSS .  Cl  RTDHCTGLGEEEMVDCQf  JggrGLERDKRKLYgRTWYSKPWCES  .HBIKKEKTCYMNHCHTSVITESCDKHTWDTMRrRYCAPPCrALLRClfPTirragrEP 

TS - TP-DQEQE - -S-DT .  ?-A-A - S - TIN - Q - TN-S- .  NQTQ - AI - Y - A- 

VA — SPSCP . DMINOTDP--QLN — S — R — 0 — E - - - L - Q-S - D-S-  .  DR-R - - - A - V - 
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T. - AIT — ATPSVAENVINE-MP — KNNS-A — EQ-P-IG-K - H - C - R-LI — QS. ANES-SR H - Q - AI - Y - S — L — A- 


HIV-2/ST 
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HIV-2/1 JY 
HIV— 2/HI H- 
HIV— 2/GH 
SIV/MAC 
SIV/SH 
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.  TQHWYVPCHI  KOI  IHTHHKVGKHVYLPPREGQLTCHrrVTSI  1  AHia  .  CCEMOTWITPSAEVAELr  RLELCDYKI.  HIV-2/ST 
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- — - 1 - A - -L—  L-V-C— — - - A— 0— - — — V - P - GQ-P-TK-CE-C-G-0 


HIV-2/ST 

HIV— 2/ROD 

RIV-2/ISY 

HIV-2/HIH, 

HIV-2/CH 

SIV/MAC 

SIV/SM 


•  893 

SVGDHWMPMPI AYIHFLI RQLI  RLLNRLYNICRDLLSRSrQTLQLISQSLRRALTAVRDMLRFNTAYLQYCGENIQEAFRAF  AR.  ATCETLTHAMRGFMCTLGQIGRGI LAVPRRIRQCAEIALL  HIV-2/ST 

MG — RY - A - T S - L - Y-N- . . LR — P - C - Q-A — . — R— AG-C— L-RV-IR -  HIV- 2/ROD 

D — SRS - 1 - —  L - T - S - LYLI  — PL . . LRA - C - Q-L—  .V-R - S-G-SL— A— R -  HXV-2/ZSY 

MG — RS - A - L - TC - - - - — ISPI— P-F— Q —I - LRA- - C - Q-L — .  T-R— AG-G-DL-RA-QR - L HIV-2/HXH* 

DD-YDL# — — H - Hl-T— TG—  R - TN3P-HA — 1 - LRA - - - Q - R.  T-R— AS— G-LCAAVQRV - - -  KXV-2/GH 

GG-NSS - Q-E - - - TH-FSN — T - AY-I  —  P-L-R-SAT-RR — IV — TILT - VSYFH — VQ-6H-S — . - AG - Dl-E— RAC— t* - X - L-LT—  SIV/MAC 

RG-SRS— Q-E - - - TM-FSS - M-L— Y-X—  PVL - STT-QR— CVX-IEX - HRYF - VQ-WVKL-.  A - AS— GOI-E— RV - 1 - L-LT—  SIV/SM 


Figure  33:  Alignment  of  the  deduced  amino  acid  sequences  of  the  envelope  open  reading  frames  of 
HIV-2/ST  (JSP4-27),  HIV-2/R0D,  HIV-2/ISY,  HIV-2/NIHz,  HIV-2/GH,  SIVmac  and  SIVsm.  The  position 
of  the  presumed  primary  envelope  precursor  cleavage  site,  as  identified  for  HIV-2/ROD,  is  shown. 
Symbols:  •,  cysteine  residues  conserved  among  all  seven  envelope  sequences;  *,  nonconserved 
cysteine  residues;  #,  in-frame  stop  codons  present  in  the  sequences  of  SIVmac  and  HIV-2/GH;  • , 
gaps  introduced  for  optimal  sequence  alignment.  Potential  N-linked  glycosylation  sites  (NXS/T)  in  the 
HIV-2/ST  envelope  sequence  are  underlined. 
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Figure  34.  Diagram  of  the  generation  of  fusogenic  and  cytopathic  variants  of  H1V-2/ST  showing  the  origins  of 
biologically  distinct  from  the  original  HIV-2/ST  isolate  by  limiting  dilution  cloning.  All  cell  line,  and  the  parental  bulk 
culture  produce  nonfusogenic  and  noncytopathic  progeny  virus  (E3 ).  Two  fusogenic  and  cytopathic  variants 
ST/24.1  C  and  ST/24.2C  were  generated  by  repeated  cell  free  passage  of  ST/24  supernatants  to  uninfected  SupTl 
cells  (H ).  The  size  of  the  envelope  transmembrane  glycoprotein  (TM)  for  virions  derived  from  each  culture  is  shown. 
kD,  Kilodaltons. 
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JSP4-27 


M3 

Clooa* 


JSP4-27 


BmBZ 

cc 

agaattgggggattataaattgatacaagtaacaccaattggctttgcacctacaccagta 

Primer  1  LysLeuIieGiuVaiThrProIleGiyPheAleProThrProV*! 


Ser 

- - C - T - 

Ser 

SerXl* 

Ser 

- -C - T - 

Ser 

^  olMT&qa  »ita 

AAAAGATACTCCTCTGCTCCAGTGAGGAATAAAAGAGGTGTATTCGTGCTAGGGTTCTTA 

LysArqTyrSerSerAiaProVaiArgAsnLysArgGlyVaiPheVaiLeuGiyPheLeu 


— G-— — — G - 

- G - G - 


ST/812 

10/10 

ST/ 2  4 

10/10 

ST/2 4. 1C 

1/9 

ST/24. 1C 

8/9 

ST/24. 2C 

5/10 

ST/24. 2C 

5/10 

ST/B12 

10/10 

ST/24 

10/10 

ST/24. 1C 

1/9 

ST/24. 1C 

8/9 

ST/24. 2C 

5/10 

ST/24. 2C 

5/10 

JSP4  -27  ggttttctcacgacagcaggaqctgcaatgggcgcggcgtccttgacgctctccgctcag 

GlyPheLeuThrThrAlaGlyAl*AiaMecGlyAlaAlaSerLeuThrLeuSerAlaGln 

- -  ST/B12  10/IQ 

-  ST/24  10/10 

-  ST/24. 1C  1/9 

-  ST/24. 1C  8/9 

-  ST/24. 2C  5/10 


JSP4-27 


JSP4-27 


TCTCGGACTTTATTGGCCGGGATAGTGCAGCAACAGCAACAGCTGTTGGACGTGGTCAAG 

SerArgThrLeuLeuAlaGlylleValGlnGlnGlnGlnGlnLeuLeuAspValValLye 


ST/B12 

10/10 

Smr 

AGACAA...//.. 

ArqGln 

mx 

G  G 

. AACATGTATGAACTACAAAAATTAAATAGC 

Primar  2 

Oltt 

ST/B12 

ST/24 

ST/24. 2C 
ST/24.2C 

10/10 

10/10 

5/10 

5/10 

Figure  35.  Nucleotide  sequence  variation  in  the  fusion  domain  of  cytopathic 
and  noncytopathic  HIV-2/ST  strains.  A  stretch  of  230  bp  of  PCR-amplified  envelope 
sequence  is  compared  between  JSP4-27  and  four  other  HIV-2/ST  strains.  The 
boundaries  of  the  amplified  fragments  are  indicated.  The  sequences  of  both  primers 
used  to  amplify  the  genomic  DNA  of  ST/BI  2,  ST/24,  ST/24.1  C  and  ST/24.2C  are 
underlined,  and  the  base  pair  changes  that  were  introduced  to  generate  BamH\  and 
Pst\  cloning  sites  are  indicated.  Nucleotide  substitutions  are  shown  with  respect  to 
the  JSP4-27  sequence,  and  amino  acid  sequence  changes  unique  to  the  cytopathic 
and  fusogenic  ST/24  strains  are  in  boldface.  Asterisks  mark  the  threonine  and  alanine 
substitutions  previously  identified  to  distinguish  JSP4-27  from  most  other  cytopathic 
HIV-2  strains.  The  number  of  Ml  3  clones  analyzed  per  HIV-2/ST  strain  is  listed,  with 
frequencies  referring  to  the  proportion  of  clones  that  have  identical  sequences. 
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4172 

4129 

4232 

4139 

4292 

4249 

4352 

4309 

4412 

4369 

4472 

4429 

4532 

4489 

4592 

4549 

4652 

4609 

4712 

4669 


TCATGTGCCCAATGTCAACAGAAAGGGGAAGCTATACATGGGCAAGTAAATGCAGAACTA 
MINI  Mill  II  II  Mill  ;  I  I  I  M  M  III  1  II  I  I  I  llllll  Mil  III 
ACATGTGATAAATGTCATCAAAAAGGAGAAGCTATACATGGGCAGGTAAATTCAGACCTA 


pol  A 


ggcacttggcaaatggactgcacacatttagaaggaaagatcattatagtagcrgtacat 

II  I  I  1 1 1 1 1 1 1 1 1 1 1 1  II  ll  III  1 1 1 1  1 1 1 1 1  II  I  1 1 1  1 1  I  M:M  I  M:i 
GGGACTTGGCAAATGGATTGTACCCATCTAGAGGGAAAAATAGTCATAGTTGC^I^ii 

EcoRI 


A  C . % 

ff'g’PGgAAGafeGR'PyfKTAGAAGCAGAAGTCATCCCACAGGAATCAGGAAGACAAACAGCA 

i  :toi  1 1 1 1  m  1 1  ii  1 1 1 1 1  iii  1 1 1 1 1 1 1 1 1 1  iiiiii 

gtagctagtggattCatagaagcagaagtaattccacaagaaacaggaagacagacagca 

pol  C 


ctcttcctattgaaactggcaagtaggtggccaataacacacttgcatacagataatggt 
II  II  II  II  111  1 1 1 1 1 1 1  II  1 1 1 1 1  II  1 1 1 1 1  1 1 1 1  1 1 1 1 1 1 1 1 1 1 1 1 
ctatttctgttaaaattggcaagcagatggcctattacacatctgcacacagataatggt 


gccaacttcacttcacaggaggtgaagatggtagcatggtggataggtatagaacaatcc 
II  1 1 1 1 1  1 1 1 1  II  II  II  1 1 1 1 1 1 1 1  1 1 1 1 1 1 1 1 1  III  1 1 1 1 1  11  II 
GCTAACTTTGCTTCGCAAGAAGTAAAGATGGTTGCATGGTGGGCAGGGATAGAGCACACC 


TTTGGAGTACCTTACAATCCACAGAGCCAAGGAGTAGTAGAAGCAATGAATCACCATCTA 

1 1 1 1 1  1 1 1 1 1  1 1 1 1 1 1 1 1 1 1 1 1 1 1  II  1 1 1 1 1 1 1 1  1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1  II 

TTTGGGGTACCATACAATCCACAGAGTCAGGGAGTAGTGGAAGCAATGAATCACCACCTG 


AAAAACCAAATAAGTACAATCAGAGAACAGGCAAATACAATAGAAACAATAGTACTAATG' 

1 1 1 1 1  llllll  II  llllll  1 1 1 1 1  llllll  II  1 1 1 1 1 1 1  llllll  1 1 1 1 1.' 
AAAAAT  CAAATAGATAGAA  T  CAGGG  AACAAGCAAAT  TCAGTAGAAAC  CATAGTAT  TAAlC! 

ECORI 

G 

GGJ^TTGATiCGCA'l 

m  ti  ft  Ewe  i  fey  miM  mm  1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1  1 1 1 1 1 

GCAGTTCATTGCATCAATTTITAAAAGAAGGGGAGGAAT  AGGGGATATGAC  TCCAGCAGAA 

pol  0 

AGATTAATCAATATGATCACCACAGAACAAGAGATACAATTCCTCCAAGCCAAAAATTCA 

MINIM  II  II II II II  II  II  I  II  I  I  I  I  I  II  II  II  I  I  III  I  I  I  II  I  III 

AGATTAATTAACATGATCACTACAGAACAAGAAATACAATTTCAACAATCAAAAAACTCA 


B^gTTtTAAAAGAAGGGGGGGAATAGGGGATATGACTCCATCAGAA 


AAATTAAAAGATTTTCGGGTCTATTTCAGAGAAGGCAGAGATCAGTTGTGGAAAGGACCT 

II  I  I  I  III  I  I  1  Mil  I  I  I  I  I  M  I  I  II  I  l  I  I  II  III  I  llllll  I  I  I  II  II  I  II  I  I 
aaatttaaaaattttcgggtctattacagagaaggcagagatcagctgtggaagggaccc 

pol  B 


HIV-2 

SIVmac 

HIV-2 

SIVklAC 

HIV-2 

SIVhac 

HIV-2 

SIVmac 

HIV-2 

SIVmac 

HIV-2 

SIVmac 

HIV-2 

SIVmac 

HIV-2 

SIVmac 

HIV-2 

SIVmac 

HIV-2 

SIVmac 


Figure  38.  Nested  HIV-2/SlV^  pol  primers  with  cloning  enzyme  sites.  Outer 

(pol  A  and  D)  and  inner  (pol  B  and  C)  oligonucleotide  primer  sequences  are  shown 
in  shaded  areas  and  are  derived  from  best  match  sequence  of  HIV-2  and  SIVMAC. 
EcoRI  sites  were  introduced  within  the  inner  primer  sequences  to  facilitate  cloning  of 
the  amplified  product.  The  size  of  the  outer  and  inner  amplification  products  are 
555bp  and  337bp,  respectively. 
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Figure  40.  Percent  amino  acid  sequence  identity  in  a  279bp  PCR  amplified  pol  region  of  primate  lentiviruses. 
The  sequences  of  the  four  West  African  SIVAGM  viruses  are  indicated  by  SIVAGMsab1^.  The  four  SIVAGMsab  viruses  differ 
from  each  other  by  1-7%  but  from  East  African  viruses  by  19-22%  and  from  isolates  of  HIV-1  and  HIV-2  by  29-31%. 
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Figure  41.  Phylogenetic  tree  analysis  for  human  and  simian  immunodeficiency  viruses. 
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del  del 
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genomic  regions  confirmed  by  nucleotide  sequence  analysis 


Flour#  42.  Molecular  provlral  clones  of  HIV-1  derived  from  uncultured  brain  DNA  of  a  paMent  with  AIDS  dementia  Ten 
hiv  i  oravirai  donea  from  a  lambda  phage  genomic  library  of  8  *  10*  recombinant  phage  were  identified  and  mapped.  Because 
form*  in  this  unculturedbrain  ONA  specimen  were  in  unintegrated  circular  form,  a  single  cutter  EcoRI,  was 
donea  weK-length  with  either  one  or  two  LTRs  (YU-2.  10  21,  32L  < one  done 
waTparfa)  but  integrated  into  genomic  DNA  (YU-6),  and  the  remaining  6  dones  were  detective  by  virtue  of  deletions  or 
rearrangements  (YU-1,  3,  4,  5,  27). 
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PREDOMINANT  FORM : 

YU- 3 

PCR-B13 

YU-4 

PCR-B16 

YU-10 

PCR-C6 

YU-21 

YU-32 

PCR-C20 

PERCENTAGE 

:  5/8=62.5% 

4/11=36.3% 

COMPARISON  WITH  THE 

PREDOMINANT  FORM 

(bp/bp) 

YU-1 

1/525=0.19% 

YU-2 

2/525=0.38% 

YU-27 

2/525=0.38% 

PCR-A7 

1/525=0 . 19% (deletion) 

PCR-B7 

3/525=0.57% 

PCR-B8 

3/525=0.57% (2 

bp  deletion) 

PCR-B9 

6/525=1. 14% (1 

stop  codon) 

PCR-B14 

1/525=0.19% 

PCR-C14 

2/525=0.38% (1 

stop  codon) 

PCR-C19 

2/525=0.38% 

Figure  43.  Summary  of  envelope  nucleotide  sequences  from  eight  HIV-1 
provlruses  in  lambda  (Yu-1,  2,  3,  4,  10,  21,  27,  and  32)  and  from  eleven  PCR 
derived  envelope  fragments  (PCR-A7,  B7,  B8,  B9,  B13,  B14,  B16,  C6,  Cl  4,  Cl  9, 
C20).  All  clones  were  derived  from  uncultured  brain  DNA  (see  text). 
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bp /bp 

NUCLEOTIDE 

AMINO  ACID 

YU-1:  1/525=0.19% 

GAT — AAT 

(Asp— Asn) 

YU-2:  2/525=0.38% 

AAG — AAT 

(Lys— Asn) 

YU-27:  2/525=0.38% 

GTA — GTC 
TCT--GCT 

(Val— Val) 
(Ser— Ala) 

PCR-A7:  1/525=0.19% 

TGG — *GG 

(Trp-lbp 

deletion) 

PCR-B7:  3/525=0.57% 

CTA — CCA 
AGA — AGG 
ACT — ACG 

(Leu—Pro) 
(Arg — Arg) 
(Thr — Thr) 

PCR-B8:  3/525=0.57% 

CTA--TTA 
ACA — A** 

(Leu--Leu) 
(Thr — 2bp 
deletion) 

PCR-B9:  6/525=1.14% 

GAG--GAA 
ACA- -ACG 
AAT — GAT 
AAG— AAT 
ACT — ACC 
TGG— TGA 

(Glu — Glu) 
(Thr — Thr) 
(Asn--Asp) 
(Lys — Asn) 
(Thr — Thr) 
(Trp—Stop) 

PCR-B14:  1/525=0.19% 

ATA — ACA 

(lie — Thr) 

PCR-C14:  2/525=0.38% 

ACT — ACC 
TGG — TGA 

(Thr — Thr) 
(Trp--Stop) 

PCR-C19:  2/525=0.38% 

CTA--TTA 

TAC--TGC 

(Leu-Leu) 

(Tyr—Cys) 

Figure  44.  Comparison  of  nucleotide  and  amino  acid  sequences  of  lambda  and 
PCR  derived  HIV-1  clones  In  relation  to  the  predominant  (consensus)  sequences 

(see  Figure  43).  All  clones  were  derived  from  uncultured  brain  DNA  (see  text). 
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